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(54) 4-Oxo-quinazolines as LXR nuclear receptor binding compounds 



(57) The present invention relates to 4-oxo-quina- 
zolines which bind to the Liver X receptors (LXR recep- 
tors, LXRalpha/NFM H3 and LXRbeta/NR1 H2) and act 
as selective agonists of the LXR receptors. The inven- 
tion further relates to the treatment of diseases and/or 
conditions through binding of said nuclear receptors and 



selective agonistic effects by said compounds and the 
production of medicaments using said compounds. In 
particular the compounds are useful in the treatment of 
hypercholesterimia, obesity or other diseases associat- 
ed with elevated lipoprotein (LDL) levels. 
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Description 

[0001] Liver X Receptor (LXR) is a prototypical type 2 nuclear receptor which activates genes upon binding to pro- 
moter region of target genes in a prototypical heterodimeric fashion with Retinoid X Receptor (hereinafter RXR, Forman 

5 et al., Cell, 81, 687-93, 1995). The term LXR includes all subtypes of this receptor. Specifically LXR includes LXRa 
(also known as LXRalpha, RLD-1 and NR1H3) and LXRb (also known as LXRbeta, NER, NER1, UR, OR-1, R1P15 
and NH1 H2) and ligands of LXR should be understood to include ligands of LXRa or LXRb. The relevant physiological 
ligands of LXR seem to be oxidized derivatives of cholesterol, including 22-hydroxycholesterol and 24,25(S)-epoxyc- 
holesterol (Lehmann, et al., Biol. Chem. 272(6), 3137-40, 1997). The oxysterol ligands bound to LXR were found to 

10 regulate the expression of several genes that participate in cholesterol metabolism (Janowski, et al., Nature, 383, 
728-31, 1996). 

[0002] LXR is proposed to be a hepatic oxysterol sensor. Upon activation (e.g. binding of oxysterols) it influences 
the conversion of dietary cholesterol into bile acids by upregulating the transcription of key genes which are involved 
in bile acid synthesis such as CYP7A1 . Hence, activation of LXR in the liver could result in an increased synthesis of 

15 bile acids from cholesterol which could lead to decreased levels of hepatic cholesterol. This proposed LXR function in 
hepatic cholesterol metabolism was experimentally confirmed using knockout mice. Mice lacking the receptor LXRa 
lost their ability to respond normally to an increase in dietary cholesterol and did not induce transcription of the gene 
encoding CYP7A1. This resulted in accumulation of large quantities of cholesterol in the livers and impaired hepatic 
function (Peet, et al., Cell, 93, 693-704, 1998). 

20 [0003] Besides its important function in liver, LXR plays an important role in the regulation of cholesterol homeostasis 
in macrophages and intestinal mucosa cells where it upregulates cholesterol transporters from the ABC (=ATP binding 
cassette) family of membrane proteins (Repa, et al., J Biol Chem. 2002 May 24;277(21 ):1 8793-800). 
[0004] These transporters are believed to be crucially involved in the uptake of cholesterol from the diet since mu- 
tations in their genes leads to diseases such as sitosterolemia (Berge, et al., Science (2000);290(5497):1771-5.). 

25 [0005] Other members of the ABC transporter family seem to be responsible for the efflux of cholesterol from loaded 
macrophages, a process which is thought to prevent the generation of atherosclerotic lesions. Stimulation of LXR by 
synthetic ligands might result in an increased cholesterol efflux from macrophages and a decreased building up of 
cholesterol loaded atherosclerotic plaques (Venkateswaran, et al., PNAS (2000) 24;97(22):1 2097-1 02; Sparrow, et al., 
J Biol Chem (2002) 277(1 2):1 0021 -7; Joseph, et al., PNAS (2002);99(11 ):7604-9). Direct evidence that synthetic LXR 

30 ligands inhibit the development of atherosclerosis has been provided in two animal models of atherosclerosis: A sig- 
nificant reduction in the formation of atherosclerotic plaques were shown in two studies in animal models using full 
LXR agonists Joseph et al. PNAS (2002) 99:7604-9 and Terasaka et al. (2003) Terasaka et al. FEBS Lett. (2003) 536: 
6-11. In addition, two recent reports have highlighted the potential use of LXR agonists in diabetes (Cao et al., (2003) 
J Biol Chem. 278:1131-6 and inflammatory disorders (Joseph et al., (2003) Nat Med. 9:213-9. 

35 [0006] However, in animal studies it was observed that activation of LXR in the liver by full agonists like T0901317 
does not only increase bile acid synthesis but also stimulates the de novo synthesis of fatty acids and triglycerids 
through the upregulation of key enzymes such as Fatty Acid Synthase (FAS) or Stearyl-CoA Desaturase (SCD-1) 
(Schultz, et al., Genes Dev (2000) 1 4(22):2831 -8). Elevation of serum triglyceride levels is an indendent risk factor for 
atherosclerosis (for review see Miller (1 999) Hosp Pract (Off Ed) 34: 67-73.). 

40 [0007] Thus, LXR activity needs to be selectively modulated for therapeutic benefit. In particular, compounds need 
to be found that stimulate reverse cholesterol transport, but do not significantly increase trigclyceride levels. This might 
be particular relevant for the usage of such compounds in diabetic patients since a even more severe lopogenic effect 
was reported for the full agonist T0901317 in db/db mice which serve as an animal model for diabetes (Chisholm et 
al. (2003) J. Lipid Res (epub August 16)). 

45 [0008] Therefore, an ideal synthetic LXR binding compound should have properties that retain the agonistic activity 
on hepatic bile acid formation and ABC-transporter-mediated decrease in cholesterol uptake from the diet and in- 
creased cholesterol efflux from macrophages. In parallel such a compound should lack the hyperlipidemic potential 
which is exerted through increased fatty acid and triclyceride synthesis. 

[0009] To date only few compounds have been described which bind the LXR receptor and thus show utility for 
50 treating diseases or conditions which are due to or influenced by said nuclear receptor (Collins, et al., J Med Chem. 
(2002) 45(1 0):1 963-6; Schultz, et al., Genes Dev (2000) 14(22):2831 -8; Sparrow, et al., J Biol Chem (2002) 277(12): 
10021-7). No non-steroidal compounds have so far been described which show selectivity regarding the induction of 
ABCtransportergeneswithoutsimultaneous inductionof lipogenic genes like FAS and SREBP-1c (Kaneko et al. (2003) 
J Biol Chem (epub July 7). 

55 [0010] It is thus an object of the invention to provide for compounds which by means of binding the LXR receptor 
act as partial agonists of said receptor with a selective property regarding the upregulation of genes like the ABC 
transporters in macrophages and/or other cell types and a stronlgy reduced liability to increase the expression of genes 
involved in triglyceride synthetic pathways (like FAS and SREBP-1 c). These compounds should show utility for treating 
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diseases or conditions which are due to or influenced by said nuclear receptor. 

[0011] It is further an object of the invention to provide for compounds that may be used for the manufacture of a 
medicament for the treatment of cholesterol associated conditions or diseases. It is still a further object of the invention 
to provide for compounds that lower serum cholesterol and/or increase High Density Lipoproteins (HDL) and/or de- 
crease Low Density Lipoproteins (LDL). It is also an object of the invention to provide for compounds that may be used 
for the treatment of lipid disorders including hypercholesterolemia, atherosclerosis, Alzheimer's disease, skin disorders, 
inflammation, obesity and diabetes. 

[0012] The present invention provides, inter alia, novel LXR nuclear receptor protein binding compounds according 
to the general formula (I) shown below. Said compounds are also binders of mammalian homologues of said receptor. 
Further the object of the invention was solved by providing for amongst the LXR nuclear receptor protein binding 
compounds according to the general formula (I) such compounds which act as partial agonists or mixed agonists / 
antagonists of the human LXR receptor or a mammalian homologue thereof. Further the object of the invention was 
solved by providing for amongst the LXR receptor protein binding compounds according to the general formula (I) such 
compounds which act as partial agonists of the human LXR receptor resulting therefore in the induction of ABC trans- 
porter proteins such as ABCA1 or ABCG1 in cell types such as macrophages but lacking a strong potential to induce 
genes involved in triglyceride synthetic pathways such as fatty acid synthase (FAS) or SREBPIc. 
[0013] The invention provides for LXR agonists that may be used for the manufacture of a medicament for the treat- 
ment of cholesterol associated conditions or diseases. In a preferred embodiment compounds are provided that lower 
serum cholesterol and/or increase High Density lipoproteins (HDL) and/or decrease Low Density Lipoproteins (LDL). 
Also compounds are provided that may be used for the treatment of lipid disorders including hypercholesterolemia, 
atherosclerosis, Alzheimer's disease, skin disorders, inflammation, obesity and diabetes. 

[0014] The invention provides for a compound of the formula (I), or pharmaceutical acceptable salts or solvates 
thereof, hereinafter also referred to as the "compounds according to the invention" including particular and preferred 
embodiments thereof. 



The compounds of the invention can also exist as solvates and hydrates. Thus, these compounds may crystallize with, 
for example, waters of hydration, or one, a number of, or any fraction thereof of molecules of the mother liquor solvent. 
The solvates and hydrates of such compounds are included within the scope of this invention. 
[0015] In one embodiment of the invention in formula (I) above, R 1? R 2 , R 3 , R 4 , - independent from each other - H, 
halogen, hydroxy, protected hydroxy, cyano, nitro, C 1 to C 6 alkyl, C 1 to C 6 substituted alkyl, C 1 to C 7 alkoxy, C 1 to C 7 
substituted alkoxy, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, C 1 to C 7 acyloxy, carboxy, protected carboxy, carboxymethyl, 
protected carboxymethyl, hydroxymethyl, protected hydroxymethyl, amino, protected amino, (monosubstituted)amino, 
protected (monosubstituted)amino, (disubstituted)amino, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)car- 
boxamide, protected N-(C 1 to C 6 alkyl)carboxamide, N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 
alkyl)sulfonyl)amino, N-(phenylsulfonyl)amino or substituted or unsubstituted phenyl, for example a biphenyl, 
R 5 is H, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, aryl, substituted 
aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl 

R 6 is H, a secondary or a tertiary amine, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted 
cycloalkyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl 
[0016] Formula (II) shows another embodiment of the invention wherein 



Formula (I) 



R1 o 




R4 
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Formula (II) 

5 
10 

15 wherein R 7 is C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, aryl, sub- 
stituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl 
[0017] Another embodiment of the invention is described in formula (III) 





R 15 R 2 , R 3 , R 4 , is H, halogen, hydroxy, protected hydroxy, cyano, nitro, C 1 to C 6 alkyl, C 1 to C 6 substituted alkyl, 
C 1 to C 7 alkoxy, C 1 to C 7 substituted alkoxy, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, C 1 to C 7 acyloxy, carboxy, 
35 protected carboxy, carboxymethyl, protected carboxymethyl, hydroxymethyl, protected hydroxymethyl, amino, pro- 

tected amino, (monosubstituted)amino, protected (monosubstituted)amino, (disubstituted)amino, carboxamide, 
protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to C 6 alkyl)carboxamide, N, N-di(C 1 to 
C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, N-(phenylsulfonyl)amino or phenyl, 

40 R 5 in formula (III) is H, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, 

aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl, 

R 8 in formula (III) is H, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, 
C 1 to C 7 acyl, C 1 to C 7 substituted acyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phe- 
45 nylalkyl, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to C 6 alkyl)carbox- 

amide, N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, N- (phenylsulfonyl) 
amino or phenyl, 

R 9 in formula (III) is H, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, 
50 c 1 to C 7 acyl, C 1 to C 7 substituted acyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phe- 

nylalkyl, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to C 6 alkyl)carbox- 
amide, N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, N- (phenylsulfonyl) 
amino or phenyl, or 

55 R 8 and R 9 in formula (III) are taken together with the N in formula (III) to form a ring of the following types: 

C 3 to C 8 mono, di- or tri- heteroatom-substituted cycloalkyl such as but not limited to piperidino, morpholino, 
thiomorpholino, piperazino, or C 1 to C 6 alkyl-, C 1 to C 6 substituted alkyl subtituted- or benzocondensated 
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derivatives thereof 

C 4 to C 10 mono, di- or tri- heteroatom-substituted bi- or tricycloalkyl or C 1 to C 6 alkyl-, C 1 to C 6 substituted 
alkyl subtituted- or benzocondensated derivatives thereof 

5 

C 3 to C 8 mono, di- or tri- heteroatom-substituted heteroaryl such as but not limited to pyrrolo, pyrazolo, oxazolo, 
thiazolo, pyridino, pyridazino, pyrimidino, pyrazino, triazino, indolo, quinolino, quinazolino or C 1 to C 6 alkyl-, 
C 1 to C 6 substituted alkyl subtituted- or benzocondensated derivatives thereof 

10 [0018] In a preferred embodiment of the invention according to formula (III) 

R 1 and R 3 are preferrably H, 

R 2 and R 4 are preferrably H, or a halogen such as iodo, bromo, chloro or fluoro or hydroxy methyl, trifluoromethyl, 
15 methoxy, amino, alkylated amino, or nitro R 5 is H, C 1 to C 8 alkyl, preferrably isopropyl, vinyl, 2-isobutyl, or cy- 

clohexyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, aryl, substituted aryl, C1 
to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl, 

R 8 and R 9 in formula (III) are taken together with the N in formula (III) to form a ring of the following types: 

20 

C 3 to C 8 mono, di- or tri- heteroatom-substituted cycloalkyl such as but not limited to piperidino, morpholino, 
thiomorpholino, piperazino, or C 1 to C 6 alkyl-, C 1 to C 6 substituted alkyl subtituted- or benzocondensated 
derivatives thereof, preferrably a 4-aryl- or 4-heteroaryl- or 4-alkyl-substituted piperazino or a 4-alkyl-substi- 
tuted piperidino 

25 

C 4 to C 10 mono, di- or tri- heteroatom-substituted bi- or tricycloalkyl or C 1 to C 6 alkyl-, C 1 to C 6 substituted 
alkyl subtituted- or benzocondensated derivatives thereof 

C 3 to C 8 mono, di- or tri- heteroatom-substituted heteroaryl such as but not limited to pyrrolo, pyrazolo, oxazolo, 
30 thiazolo, pyridino, pyridazino, pyrimidino, pyrazino, triazino, indolo, quinolino, quinazolino or C 1 to C 6 alkyl-, 

C 1 to C 6 substituted alkyl subtituted- or benzocondensated derivatives thereof 

[0019] In another preferred embodiment of the invention according to formula (III) 

35 R 1 and R 3 are preferrably H, 

R 2 and R 4 are preferrably H, or a halogen such as iodo, bromo, chloro or fluoro or hydroxy methyl, trifluoromethyl, 
methoxy, amino, alkylated amino, or nitro 

40 r 5 is H, C 1 to C 8 alkyl, preferrably isopropyl, vinyl, 2-isobutyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to 

C 8 substituted cycloalkyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl, 

R 8 is a C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, C 1 to C 6 
alkylphenyl or C 7 to C 12 substituted phenylalkyl, preferrably benzyl or (furan-2-yl)-methyl or (thiophen-2-yl)-methyl 

45 

and R 9 in formula is C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, 
C 1 to C 6 alkylphenyl or C 7 to C 12 substituted phenylalkyl, preferably, however, (ethyl)-propion-l -yl 

[0020] The symbol "H" denotes a hydrogen atom. 
50 [0021] The term "C 1 to C 7 acyl" encompasses groups such as formyl, acetyl, propionyl, butyryl, pentanoyl, pivaloyl, 
hexanoyl, heptanoyl, benzoyl and the like. Preferred acyl groups are acetyl and benzoyl. 

[0022] The term M C 1 to C 7 substituted acyl" denotes the acyl group substituted by one or more, and preferably one 
or two, halogen, hydroxy, protected hydroxy, oxo, protected oxo, cyclohexyl, naphthyl, amino, protected amino, (mon- 
osubstituted)amino, protected (monosubstituted)amino, (disubstituted)amino, guanidino, heterocyclic ring, substituted 
55 heterocyclic ring, imidazolyl, indolyl, pyrrolidinyl, C 1 to C 7 alkoxy, C 1 to C 7 acyl, C 1 to C 7 acyloxy, nitro, C 1 to C 6 alkyl 
ester, carboxy, protected carboxy, carbamoyl, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, 
protected N-(C 1 to C 6 alkyl)carboxamide, N,N-di(C 1 to C 6 alkyl)carboxamide, cyano, methylsulfonylamino, thiol, C 1 to 
C 4 alkylthio or C 1 to C 4 alkylsulfonyl groups. The substituted acyl groups may be substituted once or more, and pref- 
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erably once or twice, with the same or with different substituents. 

[0023] The term "substituted phenyl" specifies a phenyl group substituted with one or more, and preferably one or 
two, moieties chosen from the groups consisting of halogen, hydroxy, protected hydroxy, cyano, nitro, C 1 to C 6 alkyl, 
C 1 to C 6 substituted alkyl, C 1 to C 7 alkoxy, C 1 to C 7 substituted alkoxy, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, C 1 to 
C 7 acyloxy, carboxy, protected carboxy, carboxymethyl, protected carboxymethyl, hydroxymethyl, protected hy- 
droxymethyl, amino, protected amino, (monosubstituted)amino, protected (monosubstituted)amino, (disubstituted) 
amino, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to C 6 alkyl)carboxamide, 
N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, N- (phenylsulfonyl)amino or 
phenyl, wherein the phenyl is substituted or unsubstituted, such that, for example, a biphenyl results. 
[0024] Examples of the term "substituted phenyl" includes a mono- or di(halo)phenyl group such as 2, 3 or 4-chlo- 
rophenyl, 2,6-dichlorophenyl, 2,5-dichlorophenyl, 3,4-dichlorophenyl, 2, 3 or 4-bromophenyl, 3,4-dibromophenyl, 

3- chloro-4-fluorophenyl, 2, 3 or 4-fluorophenyl and the like; a mono or di(hydroxy)phenyl group such as 2, 3 or 4-hy- 
droxyphenyl, 2,4-dihydroxyphenyl, the protected-hydroxy derivatives thereof and the like; a nitrophenyl group such as 
2, 3 or 4-nitrophenyl; a cyanophenyl group, for example, 2, 3 or 4-cyanophenyl; a mono- or di(alkyl)phenyl group such 
as 2, 3 or 4-methylphenyl, 2,4-dimethylphenyl, 2, 3 or 4-(iso-propyl)phenyl, 2, 3 or 4-ethylphenyl, 2, 3 or 4-(n-propyl) 
phenyl and the like; a mono or di(alkoxyl)phenyl group, for example, 2,6-dimethoxyphenyl, 2, 3 or 4-methoxyphenyl, 

2, 3 or 4-ethoxyphenyl, 2, 3 or 4-(isopropoxy)phenyl, 2, 3 or 4-(t-butoxy)phenyl, 3-ethoxy-4-methoxyphenyl and the 
like; 2, 3 or 4-trifluoromethylphenyl; a mono- or dicarboxyphenyl or (protected carboxy)phenyl group such as 2, 3 or 

4- carboxyphenyl or 2,4-di(protected carboxy)phenyl; a mono-or di(hydroxymethyl)phenyl or (protected hydroxymethyl) 
phenyl such as 2, 3, or 4-(protected hydroxymethyl)phenyl or 3,4-di(hydroxymethyl)phenyl; a mono- or di(aminomethyl) 
phenyl or (protected aminomethyl)phenyl such as 2, 3 or 4-(aminomethyl)phenyl or 2,4-(protected aminomethyl)phenyl; 
or a mono- or di(N-(methylsulfonylamino))phenyl such as 2, 3 or 4-(N-(methylsulfonylamino))phenyl. Also, the term 
"substituted phenyl" represents disubstituted phenyl groups wherein the substituents are different, for example, 3-me- 
thyl-4-hydroxyphenyl, 3-chloro-4-hydroxyphenyl, 2-methoxy-4-bromophenyl, 4-ethyl-2-hydroxyphenyl, 3-hydroxy-4-ni- 
trophenyl, 2-hydroxy 4-chlorophenyl and the like. 

[0025] The term "heteroaryl" means a heterocyclic aromatic derivative which is a five-membered or six-membered 
ring system having from 1 to 4 heteroatoms, such as oxygen, sulfur and/or nitrogen, in particular nitrogen, either alone 
or in conjunction with sulfur or oxygen ring atoms. Examples of heteroaryls include pyridinyl, pyrimidinyl, and pyrazinyl, 
pyridazinyl, pyrrolo, furano, thiopheno, oxazolo, isoxazolo, phthalimido, thiazolo and the like. 

[0026] The term "substituted heteroaryl" means the above-described heteroaryl is substituted with, for example, one 
or more, and preferably one or two, substituents which are the same or different which substituents can be halogen, 
hydroxy, protected hydroxy, cyano, nitro, C 1 to C 6 alkyl, C 1 to C 7 alkoxy, C 1 to C 7 substituted alkoxy, C 1 to C 7 acyl, C 1 
to C 7 substituted acyl, C 1 to C 7 acyloxy, carboxy, protected carboxy, carboxymethyl, protected carboxymethyl, hy- 
droxymethyl, protected hydroxymethyl, amino, protected amino, (monosubstituted)amino, protected (monosubstituted) 
amino, (disubstituted)amino, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to 
C 6 alkyl)carboxamide, N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino or N-(phe- 
nylsulfonyl)amino groups. 

[0027] The term "substituted naphthyl" specifies a naphthyl group substituted with one or more, and preferably one 
or two, moieties either on the same ring or on different rings chosen from the groups consisting of halogen, hydroxy, 
protected hydroxy, cyano, nitro, C 1 to C 6 alkyl, C 1 to C 7 alkoxy, C 1 to C 7 acyl, C 1 to C 7 acyloxy, carboxy, protected 
carboxy, carboxymethyl, protected carboxymethyl, hydroxymethyl, protected hydroxymethyl, amino, protected amino, 
(monosubstituted)amino, protected (monosubstituted)amino, (disubstituted)amino, carboxamide, protected carboxa- 
mide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to C 6 alkyl)carboxamide, N, N-di(C 1 to C 6 alkyl)carboxamide, 
trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino or N-(phenylsulfonyl)amino. 

[0028] Examples of the term "substituted naphthyl" includes a mono or di(halo)naphthyl group such as 1 , 2, 3, 4, 5, 
6, 7 or 8-chloronaphthyl, 2, 6-dichloronaphthyl, 2, 5-dichloronaphthyl, 3, 4-dichloronaphthyl, 1 , 2, 3, 4, 5, 6, 7 or 8-brom- 
onaphthyl, 3, 4-dibromonaphthyl, 3-chloro-4-fluoronaphthyl, 1, 2, 3, 4, 5, 6, 7 or 8-fluoronaphthyl and the like; a mono 
or di(hydroxy)naphthyl group such as 1, 2, 3, 4, 5, 6, 7 or 8-hydroxynaphthyl, 2, 4-dihydroxynaphthyl, the protected- 
hydroxy derivatives thereof and the like; a nitronaphthyl group such as 3- or 4-nitronaphthyl; a cyanonaphthyl group, 
for example, 1, 2, 3, 4, 5, 6, 7 or 8-cyanonaphthyl; a mono- or di(alkyl)naphthyl group such as 2, 3, 4, 5, 6, 7 or 
8-methylnaphthyl, 1 , 2, 4-dimethylnaphthyl, 1,2,3, 4, 5, 6, 7 or 8-(isopropyl)naphthyl, 1,2,3, 4, 5, 6, 7 or 8-ethylnaphthyl, 
1, 2, 3, 4, 5, 6, 7 or 8-(n-propyl)naphthyl and the like; a mono or di(alkoxy)naphthyl group, for example, 2, 6-dimeth- 
oxynaphthyl, 1, 2, 3, 4, 5, 6, 7 or 8-methoxynaphthyl, 1, 2, 3, 4, 5, 6, 7 or 8-ethoxynaphthyl, 1, 2, 3, 4, 5, 6, 7 or 
8-(isopropoxy)naphthyl, 1, 2, 3, 4, 5, 6, 7 or 8-(t-butoxy)naphthyl, 3-ethoxy-4-methoxynaphthyl and the like; 1, 2, 3, 4, 
5, 6, 7 or 8-trifluoromethylnaphthyl; a mono-or dicarboxynaphthyl or (protected carboxy)naphthyl group such as 1, 2, 

3, 4, 5, 6, 7 or 8-carboxynaphthyl or 2, 4-di (-protected carboxy)naphthyl; a mono-or di(hydroxymethyl)naphthyl or 
(protected hydroxymethyl)naphthyl such as 1, 2, 3, 4, 5, 6, 7 or 8-(protected hydroxymethyl)naphthyl or 3, 4-di(hy- 
droxymethyl)naphthyl; a mono- or di(amino)naphthyl or (protected amino)naphthyl such as 1 , 2, 3, 4, 5, 6, 7 or 8-(amino) 
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naphthyl or 2, 4-(protected amino)-naphthyl, a mono- or di(aminomethyl)naphthyl or (protected aminomethyl)naphthyl 
such as 2, 3, or 4-(aminomethyl)naphthyl or 2, 4-(protected aminomethyl)-naphthyl; or a mono- or di-(N-methylsulfo- 
nylamino) naphthyl such as 1 , 2, 3, 4, 5, 6, 7 or 8-(N-methylsulfonylamino)naphthyl. Also, the term "substituted naphthyl" 
represents disubstituted naphthyl groups wherein the substituents are different, for example, 3-methyl-4-hydroxynaph- 
th-1-yl, 3-chloro-4-hydroxynaphth-2-yl, 2-methoxy-4-bromonaphth-1 -yl, 4-ethyl-2-hydroxynaphth-1 -yl, 3-hydroxy-4-ni- 
tronaphth-2-yl, 2-hydroxy-4-chloronaphth-1 -yl, 2-methoxy-7-bromonaphth-1 -yl, 4-ethyl-5-hydroxynaphth-2-yl, 3-hy- 
droxy-8-nitronaphth-2-yl, 2-hydroxy-5-chloronaphth-1-yl and the like. 

[0029] The term "C 1 to C 8 alkyl" denotes such radicals as methyl, ethyl, n-propyl, isopropyl, n-butyl, iso-butyl, sec- 
butyl, tert-butyl, amyl, tert-amyl, hexyl , n-heptyl, 2-heptyl, 3-heptyl, 4-heptyl, 2-methyl-1 hexyl, 2-methyl-2hexyl, 2-me- 
thyl-3-hexyl, n-octyl and the like. 

[0030] The term "C 2 to C 6 alkenyl" denotes such radicals as propenyl or butenyl. 

[0031] Examples of the above substituted alkyl groups include the 2-oxo-prop-1-yl, 3-oxo-but-1 -yl, cyanomethyl, 
nitromethyl, chloromethyl, hydroxymethyl, tetrahydropyranyloxymethyl, trityloxymethyl, propionyloxymethyl, amino, 
methylamino, aminomethyl, dimethylamino, carboxymethyl, allyloxycarbonylmethyl, allyloxycarbonylaminomethyl, 
methoxymethyl, ethoxymethyl, t-butoxymethyl, acetoxymethyl, chloromethyl, bromomethyl, iodomethyl, trifluorome- 
thyl, 6-hydroxyhexyl, 2,4-dichloro(n-butyl), 2-aminopropyl, 1 -chloroethyl, 2-chloroethyl, 1 -bromoethyl, 2-chloroethyl, 

1- fluoroethyl, 2-fluoroethyl, 1- iodoethyl, 2-iodoethyl, 1 -chloropropyl, 2-chloropropyl, 3- chloropropyl, 1 -bromopropyl, 

2- bromopropyl, 3-bromopropyl, 1 -fluoropropyl, 2-fluoropropyl, 3-fluoropropyl, 1 - iodopropyl, 2-iodopropyl, 3-iodopropyl, 
2-aminoethyl, 1- aminoethyl, N-benzoyl-2-aminoethyl, N-acetyl-2-aminoethyl, N-benzoyl-1 -aminoethyl, N-acetyl- 

1- aminoethyl and the like. 

[0032] The term "C 1 to C 8 substituted alkyl" denotes that the above C 1 to C 8 alkyl groups are substituted by one or 
more, and preferably one or two, halogen, hydroxy, protected hydroxy, oxo, protected oxo, C 3 to C 7 cycloalkyl, naphthyl, 
amino, protected amino, (monosubstituted)amino, protected (monosubstituted)amino, (disubstituted)amino, guanidi- 
no, protected guanidino, heterocyclic ring, substituted heterocyclic ring, imidazolyl, indolyl, pyrrolidinyl, C 1 to C 7 alkoxy, 
C 1 to C 7 acyl, C 1 to C 7 acyloxy, nitro, carboxy, protected carboxy, carbamoyl, carboxamide, protected carboxamide, 
N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to C 6 alkyl)carboxamide, N,N-di(C 1 to C 6 alkyl)carboxamide, cyano, 
methylsulfonylamino, thiol, C 1 to C 4 alkylthio or C 1 to C 4 alkylsulfonyl groups. The substituted alkyl groups may be 
substituted once or more, and preferably once or twice, with the same or with different substituents. 
[0033] The term "C 7 to C 12 phenylalkyl" denotes a C 1 to C 6 alkyl group substituted at any position by a phenyl, 
substituted phenyl, heteroaryl or substituted heteroaryl. Examples of such agroup include benzyl, 2-phenylethyl, 3-phe- 
nyl(n-propyl), 4-phenylhexyl, 3-phenyl(n-amyl), 3-phenyl(sec-butyl) and the like. Preferred C 7 to C 12 phenylalkyl groups 
are the benzyl and the phenylethyl groups. 

The term "C 7 to C 12 substituted phenylalkyl" denotes a C 7 to C 12 phenylalkyl group substituted on the C 1 to C 6 alkyl 
portion with one or more, and preferably one or two, groups chosen from halogen, hydroxy, protected hydroxy, oxo, 
protected oxo, amino, protected amino, (monosubstituted)amino, protected (monosubstituted)amino, (disubstituted) 
amino, guanidino, protected guanidino, heterocyclic ring, substituted heterocyclic ring, C 1 to C 6 alkyl, C 1 to C 6 substi- 
tuted alkyl, C 1 to C 7 alkoxy, C 1 to C 7 substituted alkoxy, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, C 1 to C 7 acyloxy, nitro, 
carboxy, protected carboxy, carbamoyl, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protect- 
ed N-(C 1 to C 6 alkyl)carboxamide, N, N-(C 1 to C 6 dialkyl)carboxamide, cyano, N-(C 1 to C 6 alkylsulfonyl)amino, thiol, 
C 1 to C 4 alkylthio, C 1 to C 4 alkylsulfonyl groups; and/or the phenyl group may be substituted with one or more, and 
preferably one or two, substituents chosen from halogen, hydroxy, protected hydroxy, cyano, nitro, C 1 to C 6 alkyl, C 1 
to C 6 substituted alkyl, C 1 to C 7 alkoxy, C 1 to C 7 substituted alkoxy, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, C 1 to C 7 
acyloxy, carboxy, protected carboxy, carboxymethyl, protected carboxymethyl, hydroxymethyl, protected hydroxyme- 
thyl, amino, protected amino, (monosubstituted)amino, protected (monosubstituted)amino, (disubstituted)amino, car- 
boxamide, protected carboxamide, N-(C 1 to C 6 alkyl) carboxamide, protected N-(C 1 to C 6 alkyl) carboxamide, N, N-di 
(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, N-(phenylsulfonyl)amino, cyclic C 2 to 
C 7 alkylene or a phenyl group, substituted or unsubstituted, for a resulting biphenyl group. The substituted alkyl or 
phenyl groups may be substituted with one or more, and preferably one or two, substituents which can be the same 
or different. 

[0034] Examples of the term "C 7 to C 12 substituted phenylalkyl" include groups such as 2-phenyl-1 -chloroethyl, 

2- (4-methoxyphenyl)ethyl, 4-(2,6-dihydroxy phenyl)n-hexyl, 2-(5-cyano-3-methoxyphenyl)n-pentyl, 3-(2,6-dimethyl- 
phenyl)n-propyl, 4-chloro-3-aminobenzyl, 6-(4-methoxyphenyl)-3-carboxy(n-hexyl), 5-(4-aminomethylphenyl)- 3-(ami- 
nomethyl)n-pentyl, 5-phenyl-3-oxo-n-pent-1 -yl and the like. 

[0035] The term "heterocycle" or "heterocyclic ring" denotes optionally substituted five-membered to eight-membered 
rings that have 1 to 4 heteroatoms, such as oxygen, sulfur and/or nitrogen, in particular nitrogen, either alone or in 
conjunction with sulfur or oxygen ring atoms. These five-membered to eight-membered rings may be saturated, fully 
unsaturated or partially unsaturated, with fully saturated rings being preferred. Preferred heterocyclic rings include 
morpholino, piperidinyl, piperazinyl, 2-amino-imidazoyl, tetrahydrofurano, pyrrolo, tetrahydrothiophen-yl, hexylmethyl- 
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eneimino and heptylmethyleneimino. 

[0036] The term "substituted heterocycle" or "substituted heterocyclic ring" means the above-described heterocyclic 
ring is substituted with, for example, one or more, and preferably one or two, substituents which are the same or 
different which substituents can be halogen, hydroxy, protected hydroxy, cyano, nitro, C 1 to C 12 alkyl, C 1 to C 12 alkoxy, 
C 1 to C 12 substituted alkoxy, C 1 to C 12 acyl, C 1 to C 12 acyloxy, carboxy, protected carboxy, carboxymethyl, protected 
carboxymethyl, hydroxymethyl, protected hydroxymethyl, amino, protected amino, (monosubstituted)amino, protected 
(monosubstituted)amino, (disubstituted)amino carboxamide, protected carboxamide, N-(C 1 to C 12 alkyl)carboxamide, 
protected N-(C 1 to C 12 alkyl)carboxamide, N, N-di(C 1 to C 12 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 12 alkyl) 
sulfonyl)amino, N-(phenylsulfonyl)amino, heterocycle or substituted heterocycle groups. 

[0037] The term "C 1 to C 8 alkoxy" as used herein denotes groups such as methoxy, ethoxy, n-propoxy, isopropoxy, 
n-butoxy, t-butoxy and like groups. A preferred alkoxy is methoxy. The term "C 1 to C 8 substituted alkoxy" means the 
alkyl portion of the alkoxy can be substituted in the same manner as in relation to C 1 to C 8 substituted alkyl. 
[0038] The term "C 1 to C 8 aminoacyl" encompasses groups such as formyl, acetyl, propionyl, butyryl, pentanoyl, 
pivaloyl, hexanoyl, heptanoyl, octanoyl, benzoyl and the like. 

[0039] The term "C 1 to C 8 substituted aminoacyl" denotes the acyl group substituted by one or more, and preferably 
one or two, halogen, hydroxy, protected hydroxy, oxo, protected oxo, cyclohexyl, naphthyl, amino, protected amino, 
(monosubstituted)amino, protected (monosubstituted)amino, (disubstituted)amino, guanidino, heterocyclic ring, sub- 
stituted heterocyclic ring, imidazolyl, indolyl, pyrrolidinyl, C 1 to C 12 alkoxy, C 1 to C 12 acyl, C 1 to C 12 acyloxy, nitro, C 1 
to C 12 alkyl ester, carboxy, protected carboxy, carbamoyl, carboxamide, protected carboxamide, N-(C 1 to C 12 alkyl) 
carboxamide, protected N-(C 1 to C 12 alkyl)carboxamide, N,N-di(C 1 to C 12 alkyl)carboxamide, cyano, methylsulfo- 
nylamino, thiol, C 1 to C 10 alkylthio or C 1 to C 10 alkylsulfonyl groups. The substituted acyl groups may be substituted 
once or more, and preferably once or twice, with the same or with different substituents. 

[0040] Examples of C 1 to C 8 substituted acyl groups include 4-phenylbutyroyl, 3-phenylbutyroyl, 3-phenylpropanoyl, 
2- cyclohexanylacetyl, cyclohexanecarbonyl, 2-furanoyl and 3-dimethylaminobenzoyl. 

[0041] This invention also provides a pharmaceutical composition comprising an effective amount of a compound 
according to the invention. Such pharmaceuticals compositions can be administered by various routes, for example 
oral, subcutaneous, intramuscular, intravenous or intracerebral. The preferred route of administration would be oral at 
daily doses of the compound for adult human treatment of about 0.01-5000 mg, preferably 1-1500 mg per day. The 
appropriate dose may be administered in a single dose or as divided doses presented at appropriate intervals for 
example as two, three four or more subdoses per day. 

[0042] For preparing pharmaceutical compositions containing compounds of the invention, inert, pharmaceutically 
acceptable carriers are used. The pharmaceutical carrier can be either solid or liquid. Solid form preparations include, 
for example, powders, tablets, dispersible granules, capsules, cachets, and suppositories. 

[0043] A solid carrier can be one or more substances which can also act as diluents, flavoring agents, solubilizers, 
lubricants, suspending agents, binders, or tablet disintegrating agents; it can also be an encapsulating material. 
[0044] In powders, the carrier is generally a finely divided solid which is in a mixture with the finely divided active 
component. In tablets, the active compound is mixed with the carrier having the necessary binding properties in suitable 
proportions and compacted in the shape and size desired. 

[0045] For preparing pharmaceutical composition in the form of suppositories, a low-melting wax such as a mixture 
of fatty acid glycerides and cocoa butter is first melted and the active ingredient is dispersed therein by, for example, 
stirring. The molten homogeneous mixture is then poured into convenient-sized molds and allowed to cool and solidify. 
[0046] Powders and tablets preferably contain between about 5% to about 70% by weight of the active ingredient, 
preferably comprising (especially consisting of) one or more of the compounds according to this invention. Suitable 
carriers include, for example, magnesium carbonate, magnesium stearate, talc, lactose, sugar, pectin, dextrin, starch, 
tragacanth, methyl cellulose, sodium carboxymethyl cellulose, a low-melting wax, cocoa butter and the like. 
[0047] The pharmaceutical compositions can include the formulation of the active compound with encapsulating 
material as a carrier providing a capsule in which the active component (with or without other carriers) is surrounded 
by a carrier, which is thus in association with it. In a similar manner, cachets are also included. Tablets, powders, 
cachets, and capsules can be used as solid dosage forms suitable for oral administration. 

[0048] Liquid pharmaceutical compositions include, for example, solutions suitable for oral or parenteral administra- 
tion, or suspensions, and emulsions suitable for oral administration. Sterile water solutions of the active component or 
sterile solutions of the active component in solvents comprising water, ethanol, or propylene glycol are examples of 
liquid compositions suitable for parenteral administration. 

[0049] Sterile solutions can be prepared by dissolving the active component in the desired solvent system, and then 
passing the resulting solution through a membrane filter to sterilize it or, alternatively, by dissolving the sterile compound 
in a previously sterilized solvent under sterile conditions. 

[0050] In a preferred embodiment of the invention in the compounds claimed, or the pharmaceutical acceptable salts 
or solvates thereof, R 1? R 3 and R 4 are H, R 2 is halogen and preferably iodine over bromine and chlorine and R 5 is H, 
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C 1 to C 8 alkyl or C 1 to C 8 substituted alkyl. 

[0051] For the six preferred compounds as shown in formulae (XIII), (XII), (X), (IX), (VII), and (IV) below it 
demonstrated that these compounds may act as a full or partial agonist of LXR 
Formula (XIII); MOLNAME LN7181 




(XIII) 



Formula (XII); MOLNAME LN7179 




(XII) 



Formula (X); MOLNAME LN7172 
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cxxo 



Formula (IX); MOLNAME LN6672 




Formula (VII); MOLNAME LN7031 




(X) 



(DC) 



(VII) 



Formula (IV); MOLNAME LN7033 
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(IV) 

[0052] It has also been found that the compound according to formula (IV) (MOLNAME LN7033) to be active as 
partial agonist of the LXR human nuclear receptors with a selective upregulation of the target genes ABCA1 and ABCG1 
in THP-1 cells compared to FAS and SREBP-1c in HepG2 cells (see also Fig. 8A and 8B). 
[0053] Furthermore, the present invention includes compounds of the following formula: 
Formula (V), MOLNAME LN7025 




(V) 

Formula (VI), MOLNAME LN7184 




O 



(VI) 

Formula (VIII); MOLNAME LN7032 
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[0054] In particular the invention relates to a compound as described above wherein said compounds is capable of 
35 binding the LXR receptor protein or a portion thereof encoded by a nucleic acid according to SEQ ID NO:1 or NO:2 
(Fig. 3) or a mammalian homologue thereof. This compound can bind to the LXR receptor protein or a portion thereof 
in a mixture comprising 10-200 ng of LXR receptor protein, a fusion protein containing LXR or a portion thereof, pref- 
erably the ligand binding domain, fused to a Tag, 5-100 mM Tris/HCI at pH 6,8-8,3; 60-1000 mM KCI; 0-20 mM MgCI2; 
1 00-1 000ng/|Lil BSA in a total volume of preferably about 25 jliI). 
40 [0055] A mammalian receptor protein homologue of the protein encoded by a nucleic acid according to SEQ ID NO: 
1 or 2, as used herein is a protein that performs substantially the same task as LXR does in humans and shares at 
least 40% sequence identity at the amino acid level, preferably over 50 % sequence identity at the amino acid level 
more preferably over 65 % sequence identity at the amino acid level, even more preferably over 75 % sequence identity 
at the amino acid level and most preferably over 85 % sequence identity at the amino acid level. 
45 [0056] The invention in particular concerns a method for prevention or treatment of a LXR receptor protein or LXR 
receptor protein homologue mediated disease or condition in a mammal comprising administration of a therapeutically 
effective amount of a compound according to the invention wherein the prevention or treatment is directly or indirectly 
accomplished through the binding of a compound according to the invention to the LXR receptor protein or to the LXR 
receptor protein homologue. 

50 [0057] The term mediated herein means that the physiological pathway in which the LXR receptor protein acts is 
either directly or indirectly involved in the disease or condition to be treated or prevented. In the case where it is indirectly 
involved it could be that, e.g. modulating the activity of LXR by a compound according to the invention influences a 
parameter which has a beneficial effect on a disease or a condition. One such example is that modulation of LXR 
activity leads to decreased levels of serum cholesterol or certain lipoproteins which in turn have a beneficial effect on 

55 the prevention and treatment of atherosclerosis. Herein a condition is a physiological or phenotypic state which is 
desirably altered. One such example would be obesity which is not necessarily medically harmful but nonetheless a 
non desirable phenotypic condition. In a preferred embodiment of the invention the method for prevention or treatment 
of a LXR receptor protein mediated disease or condition is applied to a human. This may be male or female. 
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[0058] Pharmaceutical compositions generally are administered in an amount effective for treatment or prophylaxis 
of a specific condition or conditions. Initial dosing in human is accompanied by clinical monitoring of symptoms, such 
symptoms for the selected condition. In general, the compositions are administered in an amount of active agent of at 
least about 1 00 jug/kg body weight. In most cases they will be administered in one or more doses in an amount not in 
5 excess of about 20 mg/kg body weight per day. Preferably, in most cases, doses is from about 100 jug/kg to about 5 
mg/kg body weight, daily. 

[0059] For administration particularly to mammals, and particularly humans, it is expected that the daily dosage level 
of active agent will be 0,1 mg/kg to 10 mg/kg and typically around 1 mg/kg. 

[0060] By "therapeutically effective amount" is meant a symptom-alleviating or symptom-reducing amount, a cho- 
10 lesterol-reducing amount, a cholesterol absorption blocking amount, a protein and/or carbohydrate digestion-blocking 
amount and/or a de novo cholesterol biosynthesis-blocking amount of a compound according to the invention. 
[0061] Likewise the invention concerns a method of treating in mammal a disease which is correlated with abnormal 
cholesterol, triglyceride, or bile acid levels or deposits comprising administering to a mammal in need of such treatment 
a therapeutically effective amount of a compound according to the invention. 
15 [0062] Accordingly, the compounds according to the invention may also be used in a method of prevention or treat- 
ment of mammalian atherosclerosis, gallstone disease, lipid disorders, Alzheimer's disease, skin disorders, inflamma- 
tion, obesity or cardiovascular disorders such as coronary heart disease or stroke. 

[0063] The invention further concerns a method of blocking in a mammal the cholesterol absorption in the intestine 
in need of such blocking comprising administering to a mammal in need of such treatment a therapeutically effective 

20 amount of a compound according to the invention. The invention may also be used to treat obesity in humans. 

[0064] The Liver X Receptor alpha is a prototypical type 2 nuclear receptor meaning that it activates genes upon 
binding to the promoter region of target genes in a heterodimeric fashion with Retinoid X Receptor. The relevant phys- 
iological ligands of LXR are oxysterols. The compounds have been demonstrated to have a high binding efficacy 
(binding coefficients measured as EC50 in the range of 1 -5 jliM) as well as agonistic and/or partial agonistic properties. 

25 Consequently they may be applied to regulate genes that participate in bile acid, cholesterol and fatty acid homeostasis 
as well as other downstream regulated genes. Examples of such genes are but are not limited to lipid absorption, 
cholesterol biosynthesis, cholesterol transport or binding, bile acid transport or binding, proteolysis, amino acid me- 
tabolism, glucose biosynthesis, protein translation, electron transport, and hepatic fatty acid metabolism. LXR often 
functions in vivo as a heterodimer with the Retinoid X Receptor. Published non-steroidal LXR agonists such as the 

30 "Tularik" compound "TO901317" (see figure 5) are known to influence the regulation of various liver genes. Genes 
found to be regulated by T0901 31 7 can be found in figure 6. Thus, the invention also concerns a method of modulating 
a gene whose expression is regulated by the LXR receptor in a mammal comprising administration of a therapeutically 
effective amount of a compound according to the invention to said mammal. 

[0065] A number of direct and indirect LXR target genes have been described whose regulated expression contrib- 
35 utes to cholesterol homeostasis and lipogenesis. In this respect the direct regulation of Cyp7A, which was shown to 
be a direct target gene of LXR at least in the rodent lineage is an important aspect of cholesterol removal by increased 
metabolism of bile acids (Lehmann et al., J Biol.Chem. 272 (6) 31 37-31 40; 1 007). Gupta et al. (Biochem. Biophys Res. 
Com, 293; 338-343, 2002) showed that LXR a regulation of Cyp7A is dominant over FXR inhibitory effects on Cyp7A 
transcription. 

40 [0066] A key transcription factor that was also shown to be a direct target gene for the LXR receptor is SREBP-1 C 
(Repa et al., Genes and Development, 14:2819-2830; 2000: Yoshikawaet al.; Mol.Cell.Biol.21 (9) 2991-3000, 2001). 
SREBP-1 C itself activates transcription of genes involved in cholesterol and fatty acid synthesis in liver but also other 
mammalian tissues. Some of the SREBPIc target genes involved in lipogenesis like FAS and SCD have shown to be 
additionally direct targets of the LXR receptors (Joseph et al.; J Biol Chem. 2002 Mar 29;277(13):1 101 9-25; Liang et 

45 al., J Biol Chem. 2002 Mar 15;277(11 ):9520-8.). 

[0067] A primary limitation for the applicability of LXR agonists as e.g. anti-atherosclerotic drugs comes from the 
observation that compounds with full agonistic activity, e.g. T0901317, not only elevate HDL cholesterol levels but do 
also increase plasma triglyceride levels in mice (Schultz et al., 2000 Genes Dev. 14:2831-8.). Concommitantly, not 
only genes that are involved in cholesterol efflux such as the cholesterol transporter ABCA1 (Venkateswaran et al., 

50 2000 PNAS. 97:12097-102.), ABCG1 as well as the lipid binding protein Apoliprotein E (Laffite et al. 2001 PNAS 98: 
507-512) are induced by full LXR agonists, but also genes involved in lipogenesis, including the fatty acid synthase 
FAS (Joseph et al 2002 J Biol Chem. 277:11019-11025), and SREB P-1c (Yoshikawa et al., 2001 Mol Cell Biol. 21: 
2991-3000). Elevation of serum triglyceride levels is an indendent risk factor for atherosclerosis (for review see Miller, 
1999). Thus, LXR activity needs to be selectively modulated for therapeutic benefit. In particular, compounds need to 

55 be found that stimulate reverse cholesterol transport, but do not significantly increase trigclyceride levels. 

[0068] Another gene that has been shown to be directly regulated by LXRs is the LPL gene, that codes for a key 
enzyme that is responsible for the hydrolysis of triglycerides in circulating lipoprotein, releasing free fatty acids to 
peripheral tissues. (Zhang et al. J Biol Chem. 2001 Nov 16;276(46):43018-24.) This enzyme is believed to promote 
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uptake of HDL cholesterol in liver, thereby promoting reverse cholesterol transport. A similar functional involvement in 
HDL clearance is described for the CETP gene product that facilitated the transfer of HDL cholesterol esters from 
plasma to the liver. LXR response elements were found in the CETP promoter and direct activation of this gene by 
LXR was demonstrated (Luo and Tall; J Clin Invest. 2000 Feb;1 05(4):513-20.). 

[0069] The regulated transport of cholesterol through biological membranes is an important mechanism in order to 
maintain cholesterol homeostasis. A pivotal role in these processes in multiple tissues like e.g. macrophages and 
intestinal mucosa cells is maintained by the ATP-binding cassette transporter proteins (ABC). ABCA1 and ABCG1 
were identified as direct LXR target genes (Costet et al.; J Biol Chem. 2000 Sep 8;275(36):28240-5) that mediate 
cholesterol efflux and prevent thereby e.g. generation of atherogenic plaques in macrophages (Singaraja et al. J Clin 
Invest. 2002 Jul;11 0(1 ):35-42). Other ABC transporters like ABCG5 and ABCG8, primarily expressed in hepatocytes 
and enterocytes have also been reported to be directly responsive to LXR agonists ( Repa et al., J Biol Chem. 2002 
May 24;277(21):1 8793-800. Kennedy et al., J Biol Chem. 2001 Oct 1 9;276(42):39438-47) and mediate the secretion 
of sterols from the liver and efflux of dietary sterols from the gut. Apolipoproteins E, C-l, C-ll, and C-IV, that fulfill 
important roles in lipoprotein/lipid homeostasis have also been shown to be direct targets of the LXR receptor ( Laffitte 
et al., Proc Natl Acad Sci USA. 2001 Jan 1 6;98(2):507-12; Mak et al.; J Biol Chem. 2002 May 24 [epub ahead of 
print]). These proteins have been found to be crucial components of chylomicrons, VLDL, IDL, and HDL and are among 
other things associated with hypertriglyceridemia and arteriosclerosis. 

[0070] Recently the LXRoc itself was shown to be regulated by both LXR receptors in human cell types including 
macrophages suggesting an auto regulatory amplification event in the response to LXR ligands which could e.g. lead 
to an enhanced stimulation of LXR target genes like e.g. ABCA1 (Bolten et al.; Mol Endocrinol. 2002 Mar;16(3):506-14.; 
Laffitte et al., Mol Cell Biol. 2001 Nov;21 (22):7558-68; Whitney et al.; J Biol Chem. 2001 Nov 23;276(47):43509-15). 
[0071] Besides the important function of LXR receptors in tissues like liver and macrophages it has recently been 
reported that that stimulation of epidermal differentiation is mediated by Liver X receptors in murine epidermis. Differ- 
entiation maker genes like involucrin, loricin and profilaggrin have been shown to be upregulated upon LXR ligand 
treatment (Komuves et al.; J Invest Dermatol. 2002 Jan;118(1):25-34.). 

[0072] Another recent report describes the regulation of cholesterol homeostasis (primarily the regulation of ABCA1 , 
ABCG1 and SREBP-1 C) by the LXR receptors in the central nervous system suggesting that LXRs may prove benefical 
in the treatment of CNS diseases such as Alzheimer's and Niemann-Pick disease that are known to be accompanied 
by dysregulation of cholesterol balance (Whitney et al.; Mol Endocrinol. 2002 Jun;16(6):1378-85). 
[0073] Activation of LXR by an agonist improves glucose tolerance in a murine model of diet-induced obesity and 
insulin resistance. Gene expression analysis in LXR agonist-treated mice reveals coordinate regulation of genes in- 
volved in glucose metabolism in liver and adipose tissue, e.g. the down-regulation of peroxisome proliferator-activated 
receptor gamma coactivator-1 alpha (PGC-1), phosphoenolpyruvate carboxykinase (PEPCK), and glucose-6-phos- 
phatase expression and induction of glucokinase in liver. In adipose tissue, activation of LXR led to the transcriptional 
induction of the insulin-sensitive glucose transporter, GLUT4. LXR agonist may limit hepatic glucose output and improve 
peripheral glucose uptake (Laffitte et al. (2003) PNAS 100:5419-24). 

[0074] Therefore one other important embodiment of the invention concerns methods that enhances or suppresses 
amongst other today yet unknown LXR target genes the above mentioned genes and the associated biological proc- 
esses and pathways through LXR compounds that are subject of this invention. 

[0075] The compounds according to the invention may be used as medicaments, in particular for the manufacture 
of a medicament for the prevention or treatment of a LXR receptor protein or LXR receptor protein homologue mediated 
disease or condition in a mammal wherein the prevention or treatment is directly or indirectly accomplished through 
the binding of the compound according to the invention to the LXR receptor protein or LXR receptor protein homologue. 
These pharmaceutical compositions contain 0,1 % to 99,5 % of the compound according to the invention, more par- 
ticularly 0,5 % to 90 % of the compound according to the invention in combination with a pharmaceutically acceptable 
carrier. 

[0076] The invention concerns also the use of a compound according to the invention for the manufacture of a med- 
icament for the prevention or treatment of a LXR receptor protein mediated disease or condition wherein the mammal 
described above is a human. The medicament may be used for regulating the cholesterol transport system, for regu- 
lating levels of cholesterol, triglyceride, and/or bile acid in a mammal preferentially a human by activating the LXR 
receptor. The medicament may be used for the treatment of atherosclerosis, gallstone disease, lipid disorders, Alzher- 
imer's disease, skin disorders, obesity or a cardiovascular disorder. 

[0077] The invention further concerns the use of a compound according to the invention for the manufacture of a 
medicament capable for blocking in a mammal, preferentially in a human the cholesterol absorption in the intestine. 
Further the claimed compound may be used for the manufacture of a medicament for treating obesity in humans and 
for modulating a gene whose expression is regulated by the LXR receptor (see details above and figures). 
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EXAMPLES 

EXAMPLE 1 

[0078] In vitro screening for compounds which influence LXR binding to coactivators. 

[0079] For screening purposes a GST and 6 x His fusion of the LBD (from amino acids 1 55 of hLXRalpha to 447) of 
human LXRalpha is constructed by first cloning a Gateway cassette (Invitrogen) in frame into the Sma I site of the 
pAGGHLT Polylinker (Pharmingen). Then a PCR fragment specifically amplified from human liver cDNA is cloned into 
the resulting pACGHLT-GW following the manufacturers instructions for Gateway cloning (Invitrogen) to yield 
pACGHLT-GW-hLXRalphaLBD. 
Primers used for Amplification are: 

GGGGACAAGTTTGTACAAAAAAGCAGGCTCGCTTCGCAAATGCCGTCAG 

and 

GGGGACCACTTTGTACAAGAAAGCTGGGTCCCCTTCTCAGTCTGTTCCACTT . 

1 00 % sequence integrity of all recombinant products is verified by sequencing. Recombinant Baculovirus is construct- 
ed from pACGHLT-GW-hLXRalphaLBD using the Pharmingen Baculovirus Expression vector system according to 
instructions of the manufacturer. Monolayer cultures of SF9 cells are infected by the virus as recommended by Pharmin- 
gen or 200ml cultures of 1 x10 6 cells/ml grown in 2 liter Erlenmeyer flasks on an orbital shaker at 30 rpm are infected 
by 10ml of same virus stock. In both cases cells are harvested 3 days after infection. All cell growth is performed in 
Gibco SF900 II with Glutamine (Invitrogen) medium without serum supplementation at 28°C. Since SF9 cells contain 
significant amounts of endogenous GST, purification is performed via His and not via GST affinity chromatography. To 
this end instructions of Pharmingen for purification of recombinant His tagged proteins from SF9 cells are followed with 
the following modifications: All detergents are omitted from the buffers and cells were lysed on ice by 5 subsequent 
sonication pulses using a sonicator needle at maximum power. All eluates are dialyzed against 20 mM Tris/HCI pH 
6,8, 300 mM KCI; 5 mM MgCI 2 ; 1 mM DTT; 0,2 mM PMSF; 10% Glycerol. A typical dialyzed eluate fraction contains 
the fusion protein at a purity of more than 80%. Total protein concentration is 0,1-0,3 mg/ml. 

For E. coli expression of a NR coactivator, pDestl 7-hTif2BD expressing a NR interaction domain from amino acids 
548-878 of human Tif2 (Acc. No: XM_011633 RefSeq) tagged by 6 N-terminal His residues is constructed. Therefore, 
a PCR fragment specifically amplified from human liver cDNA is subcloned into pDest 17 (Invitrogen) following the 
manufacturers instructions for Gateway cloning (Invitrogen). Primers used for Amplification are: 

GGGGACAAGTTTGTACAAAAAAGCAGGCTCGTTAGGGTCATCGTTGGCTTCACC 

and 

GGGGACCACTTTGTACAAGAAAGCTGGGTCTCAAAGTTGCCCTGGTCGTGGGT 
TA 

For E. coli expression plasmid DNA is transformed into chemically competent E. coli BL21 (Invitrogen, USA) and cells 
are grown to an OD600 of 0.4-0.7 before expression was induced by addition of 0,5 mM IPTG according instructions 
of the manufacturer (Invitrogen). After induction for 8 hours at 30°C cells are harvested by centrifugation for 1 0 minutes 
at 5000 x g. Fusion proteins are affinity purified using Ni-NTA Agarose (QIAGEN) according to the instructions of the 
manufacturer. Recombinant Tif2 construct is dialyzed against 20 mM Tris/HCL pH 7.9; 60 mM KCI; 5 mM MgCI 2 ; 1 
mM DTT, 0,2 mM PMSF; 1 0% glycerol. A typical dialyzed eluate fraction contains the fusion protein at a purity of more 
than 80%. Total protein concentration is 0,1-0,3 mg/ml. 

The TIF2 fragment is subsequently biotinylated by addition of 5-40|nl/ml Tif2 fraction of a Biotinamidocaproate N-Hy- 
droxysuccinimide-ester (Sigma) solution (20 mg/ml in DMSO). Overhead rotating samples are incubated for 2 hours 
at room temperature. Unincorporated label is then separated using G25 Gel filtration chromatography (Pharmacia 
Biotech, Sweden). Protein containing fractions from the column are pooled and tested for activity in the assay as 



EP 1 398 032 A1 



described below. 

[0080] For screening of compound libraries as provided for by the methods shown below in the examples for sub- 
stances which influence the LXR/Tif 2 interaction, the Perkin Elmer LANCE technology is applied. This method relies 
on the binding dependent energy transfer from a donor to an acceptor fluorophore attached to the binding partners of 

5 interest. For ease of handling and reduction of background from compound fluorescence LANCE technology makes 
use of generic fluorophore labels and time resoved detection (for detailed description see Hemmila I, Blomberg K and 
Hurskainen P, Time-resolved resonance energy transfer (TR-FRET) principle in LANCE, Abstract of Papers Presented 
at the 3 rd Annual Conference of the Society for Biomolecular Screening, Sep., California (1 997)) 
For screening, 20-200 ng of biotinylated Tif 2 fragment and 1 0-200 ng of GST-LXR fragment are combined with 0.5-2 

10 nM LANCE Eu-(W1024) labelled anti-GST antibody (Perkin Elmer) and 0,1-0,5jiig of highly fluorescent APC-labelled 
streptavidin (Perkin Elmer, AD0059) in the presence of 50\M of individual compounds to be screened in a total volume 
of 25 |ul of 20 mM Tris /HCI pH 6,8; 300 mM KCI; 5 mM MgCI2; 1 00-1 000 ng/jul/ BSA DMSO content of the samples is 
kept below 4%. Samples are incubated for a minimum of 60 minutes in the dark at room temperature in FIA-Plates 
black 384well med. binding (Greiner). 

15 [0081] The LANCE signal is detected by a Perkin Elmer VICTOR2V™ Multilabel Counter applying the detection 
parameters listed in Fig. 2. The results are visualized by plotting the ratio between the emitted light at 665 nm and at 
615 nm. For every batch of recombinant proteins amount of proteins, including BSA and labeling reagents giving the 
most sensitive detection of hits is determined individually by analysis of dose response curves for 22R Hydroxycho- 
lesterol and TO 901317 

20 

EXAMPLE 2 

[0082] Experimental procedure for the preparation of the compounds according to the invention. 

25 o-AZIDOBENZOIC ACID SYNTHESIS (2) 

[0083] The anthranilic acid (1 , 1 eq., 0.5-1 M) is suspended in 6 M HCI, containing enough AcOH (0-20% dependent 
upon the anthranilic acid) to facilitate dissolution of the anthranilic acid and/or the intermediate diazonium salt, and 
cooled to 0 °C. NaN0 2 (1.1 eq., 1.3-2.5 M) dissolved in H 2 0 is added to the anthranilic acid solution at a rate such 

30 that the temperature of the reaction solution remains below 5 °C. The resulting homogeneous solution of the diazonium 
salt is slowly filtered through a sintered glass funnel into a solution of NaN 3 (1.1 eq., 0.7-1 .1 M) and NaOAc (12 eq.) 
in H 2 0. The reaction mixture is stirred/shaken for 30-60 min following cessation of vigorous N 2 evolution. Following 
acidification of the reaction mixture to pH 1 with concentrated HCI, the mixture was cooled to 0 °C to encourage complete 
precipitation of the o-azidobenzoic acid. The precipitate is collected by filtration and washed with 6 M HCI (2x) and 

35 H 2 0 (2x). The o-azidobenzoic acid product (2) is dried in vacuo (500 mtorr, 30°C). 

ACYLATION OF HYDROXYMETHYL RESIN (4) 

[0084] To hydroxymethyl resin (1 .0 eq., 1 .3 mmol/g) and the o-azidobenzoic acid (1 , 2.5 eq.) is added DMF (to give 
40 400 mM o-azidobenzoic acid , 1 ), CsCO s (2.0 eq.) and Kl (2.0 eq.). Following agitation of the reaction mixture for 36-48 
h, the resin-bound o-azidobenzoic acid (4) is washed with MeOH (2 cycles), CH 2 CI 2 (3 cycles), MeOH (3 cycles), DMF 
(3 cycles), MeOH (3 cycles) and CH 2 CI 2 (3 cycles), and dried in vacuo. 

AZA-WITTIG FORMATION (5) 

45 

[0085] To the resin-bound o-azidobenzoic acid (4,1 .0 eq.) is added a solution of PPh 3 (THF, 500 mM, 5.0 eq.). After 
6 h, the resin is washed with 3 cycles of the following: THF (3 cycles), toluene (3 cycles), CH 2 CI 2 (3 cycles) and hexanes 
(3 cycles). Followed by drying in vacuo to afford resin bound iminophosphorane (5) 

50 CARBODIIMIDE FORMATION (6) 

[0086] To the resin-bound iminophosphorane (5, 1 eq.) is added isocyanate (9 , 5 eq., 450 mM) dissolved in 
CICH 2 CH 2 CI. The compounds are shaken at ambient temperature for 1 6 h, washed with 3 cycles of the following: THF 
(3 cycles), toluene (3 cycles), CH 2 CI 2 (3 cycles) and hexanes (3 cycles), and dried in vacuo to afford carbodiimide (6). 

55 

GUANIDINE FORMATION / CYCLIZATION 

[0087] To the carbodiimide functionalized resin (6) is added secondary amine (10, 0.6 eq., 500 mM) dissolved in 
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CICH 2 CH 2 CI. The reaction mixture is heated to 50 °C in an incubator for 12-72 h to afford 2-aminoquinazoline (8). 
[0088] All of the final products are analyzed using an Evaporative Light Scattering Detector (ELSD) detection to 
determine purity. 

EXAMPLE 3 

[0089] This example illustrates that compounds according to the invention (experiments shown were done with MOL- 
STRUCTURE LN0000007033 (see formulas (9)) activate luciferase reporter gene expression in a dose dependent 
manner mediated through GAL4-LXRa-LBD or GAL4-LXRb-LBD constructs in HEK293 cells. LN0000007033 does 
activate LXR beta LBD truct mediated luciferase activity much stronger than with LXR alpha construct which is in 
contrast to the similar activation of both LXR alpha and LXR beta LBD containing constructs with T0901317 (see Fig. 7). 
[0090] HEK293 cells are grown in 96 well plates and co-transfected with pFR-luc (Stratagene) and pC- 
MV-BD-LXRa-LBD or pCMV-BD-LXRb-LBD (each 1 00 ng of plasmid DNA per well). Transfection is carried using Lip- 
fectamine 2000 (Gibco-BRL) according to the manufacturers protocol. The ligand binding domains (LBD) of LXRa and 
LXRb are cloned into the pCMV-BD-GW (the Gateway Reading Frame Cassette B is cloned as an EcoRV fragment 
into Smal site of pCMV-BD) applying the manufacturer protocols for the Gateway™ system (Invitrogen). 
Luciferase reporter activity is measured in triplicates from extracts of cells after incubating cells in culture medium 
(DM EM [Gibco-BRL] + 10% FCS [PA A laboratories]) for 16 hours (5% C0 2 , 37°C) containing 0,5% DMSO (control) or 
0,5% DMSO with increasing concentrations of LN0000007033. The type of assay used here is a mammalian one hybrid 
(M1 H) assay that is known to those skilled in the art. 

[0091] Dose-dependent luciferase activities originating from pFR-luc demonstrate the relative activities of the com- 
pounds with the LXRa or LXRb LBDs in this mammalian one hybrid type approach. 

EXAMPLE 4 

[0092] This example shows that described compounds can increase the abundance of mRNA of LXR target genes 
like ABCA1 and ABCG1 in THP-1 cells which are treated with TPA or FAS and SREBP-1c in HepG2 cells as shown 
in Fig 8A and Fig. 8B. 

[0093] THP-1 cells are seeded in 24 well plates at 3 x10 5 cells per well in RPMI 1640 medium containing 10 % FCS 
and 100 nM TPA for 24 h. HepG2 cells are seeded in poly-L-Lysine coated 24well plates at 1x10 6 cells per well in 
EMEM medium containing 1 0 % FCS until they are appr. 60% confluent. 

Before treatment with LXR compounds, the growth medium is changed to medium containing 10% charcoal/dextran- 
stripped FCS for 12 h. Treatment is done for 12h (THP-1 cells) and 24h (HepG2 cells), respectively, in medium con- 
taining 10% charcoal/dextran-stripped FCS (and 100 nM PMA in the case of THP-1 cells). 

LXR compounds are dissolved in DMSO, with the final solvent concentration never exceeding 0.125%. All treatments 
are done in triplicates and experiments repeated twice. Total RNA is extracted using the Qiagen Rneasy Mini Kit and 
treated with DNase (DNAfree kit, Ambion). RNA is reverse transcribed with Oligo(dT) primer and real-time reverse 
transcription PCR (TaqMan) is performed using the ABI Prism 7900HT Sequence Detection System and reagents 
supplied by Applied Biosystems. mRNA steady state levels are normalised to H3 histone (H3F3A) expression levels. 
The sequences of forward primers, reverse primers and TaqMan probes are as follows : 

FAS : CTGAGACGGAGGCCATATGCT, G CTG CC AC ACG CTCCTCTAG , FAM- 
CAGCAGTTCACGGACATGGAGCACAA-TAMRA 

ABCA1 :TCCTGTGGTGTTTCTGGATGAAC, CTTG AC AA C ACTTAG G G C AC AATTC , 
FAM- ACCACAGGCATGGATCCCAAAGCC-TAMRA 

EXAMPLE 5 

[0094] LN0000007033 causes a marked increase in cholesterol export in differentated THP-1 macrophages (see 
Figure 9 A). 

Strikingly, the compound T0901317 causes a marked increase in triyglyceride mass in HepG2 liver cells, while com- 
pounds like LN0000007033 and LN0000007025 do not cause a significant increase in triglyceride mass. 
This behavior is similar to the selective transcriptional effect of compounds like LN0000007033 and LN0000007025 
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on the LXR target genes in HepG2 versus THP-1 cells (see Fig 8A and 8B). 

[0095] Methods: Cultures of the monocyte-macrophage cell line and the hepatocytes HepG2 are obtained from the 
American Type tissue Culture Collection, Rockville, MD and were grown in RPMI 1640 medium supplemented with 
10% fetal bovine serum, 10 mM HEPES, 2 mM Pyruvat, 50 jliM (3-Mercaptoethanol (THP-1) and Minimum essential 

5 medium (Eagle) with 2 mM L-glutamine and Earle's BSS supplemented with 1 0% fetal bovine serum, 2 mM glutamine, 
0.1 mM non-essential amino acids, 1 mM sodium pyruvate (HepG2), respectively, at 37°C in 5% C02. 
THP-1 cells are differentiated into macrophages by addition of 1 00 nM Phorbol 12-Myristate 13-Acetate (PMA; Sigma 
P8139) and PMA included in the medium of all subsequent experiments to maintain differentiation. 
For cholesterol efflux measurements and triglyceride analysis cells are seeded in 6well plates at 1.8x106 cells (THP- 

10 1) and 1x106 cells (HepG2) per well, respectively. 

CHOLESTEROL EFFLUX 

[0096] THP-1 cells are seeded in 6well plates at 1x10 6 cells per well in RPMI 1640 medium containing 10 % FCS 
15 and 1 00 nM TPA for 72 h. After washing with PBS cells are incubated 24 h with fresh in RPMI 1 640 medium containing 
10 % FCS and 100 nM TPA. Cells are washed twice with PBS and RPMI 1640 medium containing 0.15% BSA and 
100 nM TPA is added forfurther24 h. Treatment with LXR compounds is done for 24 h in RPMI 1640 medium containing 
0.15% BSA, 100 nM TPA and 40 jug/ml ApoA1 (Calbiochem # 178452). Medium is collected, centrifuged to remove 
cell debris and assayed for cholesterol using a commercial fluorometric kit (Molecular Probes A-1 221 6). The remaining 
20 cellular proteins are lysed with 0.3 N NaOH and protein content measured with the Biorad Bradford reagent. 

TRIGLYCERIDE ASSAY 

[0097] HepG2 cells are seeded in poly-L-Lysine coated 6well plates at 1x10 6 cells per well in EMEM medium con- 
25 taining 10 % FCS until they were appr. 60% confluent. Before treatment with LXR compounds growth medium is 
changed to medium containing 10% charcoal/dextran-stripped FCS for 12 h. Treatment is done for 24h in medium 
containing 10% charcoal/dextran-stripped FCS. Cells are washed twice with ice-cold PBS/0.2% BSA and twice with 
cold PBS and all liquid carefully removed. Triglyceride are extracted with with 1.5 ml hexane/ isopropanol = 3:2 per 
well with gentle shaking for 2-3 h at RT according to Pan et al. (2002) JBC 277, 4413-4421 and Goti et al. (1998) 
30 Biochem J, 332 , 57-65. The extraction solution is collected, dried under vacuum and redissolved in isopropanol/ 1 .5% 
triton. The remaining cellular proteins are lysed with 0.3 N NaOH and protein content measured with the Biorad Bradford 
reagent. 

Triglyceride levels are measured as esterified glycerol using a commercial enzymatic colorimetric kit (Sigma 343-25P). 
In a preliminary assay it is checked by omitting the lipase enzyme that contribution of free glycerol is negligible. 

35 

FIGURE CAPTIONS 
FIG. 1 

40 [0098] Fig. 1 shows the synthesis of the compounds according to the invention as also described in Example 2. 
FIG. 2 

[0099] Fig. 2 shows the measurement parameters employed by the Wallace VICTOR2V™ Multilabel Counter which 
45 was used for measuring the EC 50 values 

FIG. 3 

[0100] Shows a table with the accession numbers for the key genes 

50 

FIG. 4 

[0101] Fig. 4 shows the internal molecular name used by the applicant (MOLNAME) as well as the corresponding 
structures of preferred compounds according to the invention. The figure further shows their respective EC 50 values 
55 (EC50 AVG) as established according to the Example 1 in multiple experiments (see above), as well as their respective 
average efficacy (% activity relative to TO901317 control agonist). 
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FIG. 5 

[0102] Figure 5 shows various known LXR ligands. The compound T0901317 is used as a reference compound 
here. It is apparent from their structures that the inventors have identified novel compounds which are structurally not 
related to these known ligands. 

FIG. 6 

[0103] Figure 6 shows various genes that have been found to be regulated through binding of an LXR agonist to the 
LXR protein. 

FIG. 7 

[0104] Fig 7 shows a dose dependence of LN0000007033 with LXR beta LBD containing constructs in mammalian 
one hybrid (M1 H) type assays. The respective jliM concentrations of the compounds are given on the x-axis and the 
relative light units (RLU) are depicted on the y-axis. 

Fig. 8 A and B 

[0105] Analysis of mRNA content of the indicated genes (LXRalpha, ABCA1 , ABCG1 , FAS and SREBP-1 c) in total 
RNA isolated from THP-1 cells (8A, LXRalpha, ABCA1 , ABCG1 ) or HepG2 cells (8B, FAS and SREBP-1 c) treated for 
12 or 24 hours with 10 juM on x-axis of T0901317 and LN0000007033. The relative fold induction is depicted on the 
y-axis and the value depicted with * for ABCG1 has to be multiplicated by a factor of 1 0. 

Fig. 9 

[0106] Analysis of relative fold increase in total cholesterol from supernatants of cultivated THP-1 cells (indicated on 
the y-axis) incubated with ApoA1 and with or without 1 OjliM of the compounds T0901317, LN0000007033 and 
LN0000007025 as indicated on the X-axis of Fig 9A. 

Analysis of relative levels of total triglyceride (TG) content in HepG2 cells (indicated on the y-axis) treated with 25juM 
of the indicated compounds T0901317, LN0000007033 and LN0000007025 (indicated on the x-axis). 
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<212> DNA 














35 


<213> Homo sapiens 












<400> 3 
gtggcatcct 


tccctttcta 


atcagagatt 


ttcttcctca gagattttgg 


cctagatttg 


oU 


40 


caaaatgatg 


accacatctt 


tgatttgggg 


gattgctata gcagcatgct 


gttgtctatg 




gcttattctt 


ggaattagga 


gaaggcaaac 


gggtgaacca 


cctctagaga 


atggattaat 


1 OA 

loU 




tccatacctg 


ggctgtgctc 


tgcaatttgg 


tgccaatcct 


cttgagttcc 


tcagagcaaa 




45 


tcaaaggaaa 


catggtcatg 


tttttacctg 


caaactaatg 


ggaaaatatg 


tccatttcat 




cacaaatccc 


ttgtcatacc 


ataaggtgtt 


gtgccacgga aaatattttg 


attggaaaaa 


OCA 




atttcacttt 


gctacttctg 


cgaaggcatt 


tgggcacaga 


agcattgacc 


cgatggatgg 


y| "5 A 


50 


aaataccact 


gaaaacataa 


acgacacttt 


catcaaaacc 


ctgcagggcc 


atgccttgaa 


4cSU 


ttccctcacg 


gaaagcatga 


tggaaaacct 


ccaacgtatc 


atgagacctc 


cagtctcctc 


540 




taactcaaag 


accgctgcct 


gggtgacaga 


agggatgtat 


tctttctgct 


accgagtgat 


600 


55 


gtttgaagct 


gggtatttaa 


ctatctttgg 


cagagatctt 


acaaggcggg 


acacacagaa 


660 


agcacatatt 


ctaaacaatc 


ttgacaactt 


caagcaattc gacaaagtct 


ttccagccct 


720 



EP 1 398 032 A1 





ggtagcaggc 


ctccccattc 


acatgttcag 


gactgcgcac 


aatgcccggg 


agaaac tggc 


/ oU 




agagagcttg 


aggcacgaga 


acctccaaaa 


gagggaaagc 


at ctcagaac 


tgatcagcct 


o4U 


5 


gcgcatgttt 


ctcaatgaca 


ctttgtccac 


_ 4_4-4-/-«-.4-/~.-.4- 

ctttgatgat 


ctggagaagg 


ccaagacaca 


yuu 




cctcgtggtc 


ctctgggcat 


cgcaagcaaa 


caccattcca 


gcgactttct 


ggagtttatt 


ybU 




tcaaatgatt 


aggaacccag 


aagcaatgaa 


agcagctact 


gaagaagtga 


aaagaacatt 


lU^U 


10 


agagaatgct 


ggtcaaaaag 


tcagcttgga 


aggcaatcct 


— »4-4-4-i~l4--t--t-^-l-^ 

att tgtttga 


gtcaagcaga 


lUoU 




actgaatgac 


ctgccagtat 


4- -»-»-« 4- — « 4- -» 4- 
taaatagtat 


aatcaaggaa 


tcgctgaggc 


tttccagtgc 


±±4U 




ctccctcaac 


atccggacag 


ctaaggagga 


-f-4-4-/ , /— 4- 4- 4- y-i 

tttcactttg 


caccttgagg 


acggttccta 


1 inn 


15 


caacatccga 


aaagatagca 


4- j- 4- — • — — 4> — - 4^ 

tcatagctct 


ttacccacag 


4-4- «> 4- « -1 4— 
ttaatgcact 


tagatccaga 






aatctaccca 


gaccctttga 


cttttaaata 


4_-_— ,4-— .— --».4-._.4- 

tgataggtat 


cttgatgaaa 


acgggaagac 






aaagactacc 


ttctattgta 


atggactcaa 


gttaaagtat 


tactacatgc 


cctttggatc 




20 


gggagctaca 


-«4--x4-«4-.»"-.»"-4-»-« 

atatgtcctg 


-1 -» « 4— 4- « 4- 4- 

gaagattgtt 


cgctatccac 


gaaatcaagc 


_ _4-4-4-4-4-/-^-.4- 

aatttttgat 


144U 




tctgatgctt 


tcttattttg 


aattggagct 


tatagagggc 


caagctaaat 


gtccaccttt 


IbUO 




ggaccagtcc 


cgggcaggct 


4- » « « — . 4- 4-4- 4- 
tgggcatttt 


gccgccattg 


aatgatattg 


aatttaaata 


1 C C{\ 

IddO 


25 


taaattcaag 


catttgtgaa 


tacatggctg 


gaataagagg 


acactagatg 


atattacagg 






actgcagaac 


accctcacca 


cacagtccct 


ttggacaaat 


gcatttagtg 


gtggtagaaa 


"1 COA 

1680 




tgattcacca 


ggtccaatgt 


tgttcaccag 


tgcttgcttg 


tgaatcttaa 


cattttggtg 


1740 


30 


acagtttcca 


gatgctatca 


cagactctgc 


tagtgaaaag 


aactagtttc 


taggagcaca 


1500 




4- -» -4- J- 4- —« 4- 4- 

ataatttgtt 


■4- 4- -» 4- 4- ■*- — . 4- — * 
ttcatttgta 


taagtccatg 


„ .4_-_4_4_-« — 4- 

aatgttcata 


tagccaggga 


4- 4- _ _ _ _ 4- 4- 4- _ 

ttgaagttta 


1 OCA 

lobO 




ttattttcaa 


aggaaaacac 


ctttatttta 


ttttttttca 


aaatgaagat 


acacattaca 


1920 


35 


gccaggtgtg 


gtagcaggca 


cctgtagtct 


tagctactcg 


agaggccaaa 


gaaggaggat 


1 no c\ 

lyoO 




ggcttgagcc 


caggagttca 


agaccagcct 


ggacagctta 


gtgagatccc 


gtctccgaag 


2040 




aaaagatatg 


tattctaatt 


ggcagattgt 


tttttcctaa 


ggaaactgct 


4-4-_t4-4-4-4-4-.-a4- 

ttatttttat 


1 1 A A 

2100 


40 


aaaactgcct 


gacaattatg 


aaaaaatgtt 


caaattcacg 


4-4-^~4-~.^-~ < 4-»— ■ -» 

ttctagtgaa 


actgcattat 


21b0 




ttgttgacta 


gatggtgggg 


ttcttcgggt 


gtgatcatat 


atcataaagg 


^ -4- ^ 4— -4— ^ . . . 

atatttcaaa 


m a 




tgattatgat 


tagttatgtc 


ttttaataaa 


aaggaaatat 


ttttcaactt 


cttctatatc 


2280 


45 


caaaattcag 


ggctttaaac 


atgattatct 


tgatttccca 


aaaacactaa 


aggtggtttt 


2340 




•%4»4_4-4>-,— — ,— 4_4w 
att ttccctt 


catgttttaa 


— - 4_ 4- 4- 4— - 

cttattgttg 


ctgaaaactc 


tatgtccggc 


4- 4- 4- -» —S.S—4-—\+-*— 

tttaactatc 


") >l AA 




-4--»-^— 4- 4- _< 4- 4- 

ttctctatat 


ttttatttca 


ttcacattaa 


tgagaagagt 


tttctcagag 


attaaaaaag 


"> yf ca 
24b0 


50 


*~>4--%.<~«4-4-4-4-4-.«-> 

gtagtttttc 


tgtcattgtt 


aaatacacat 


tatcactgaa 


aaaatgtagc 


4-4-4-4- -» 4- « — . 4- 

ttttatgatg 


*> r "5 a 




tatgttttaa 


agttaaaact 


ggatggaaat 


agccatttgg 


aagctttggt 


tatgaaacat 


2580 




gtggagtgta 


ttaagtgcag 


cttgacatta 


tgttttattt 


aaatgctttt 


tatcgctaaa 


2640 


55 


tgacttgcag 


atgaaaaaaa 


ctaaggtgac 


tcgagtgttt 


aaatgcctgt 


gtacaacaat 


2700 




gctttgataa 


aatattttaa 


ggtatgagtt 


atcagctcta 


tgtcaattga 


tatttctgtg 


2760 
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tagtatttat atttaaatta tatttacctt tttgcttatt ttacaaatat taagaaaata 2820 
ttctaacatt tgataatttt gaaatgattc atctttcaga aataaaagta tgaatct 2877 

<210> 4 

<211> 8461 

<212> DNA 

<213> Homo sapiens 



<400> 4 



15 


gagggagcca 


gagagacggc 


agcggccccg 


gcctccctct 


ccgccgcgc l 


xcagcc lccc 


AO 
OU 




gctccgccgc 


gc tccagcct 


cgctctccgc 


cgcccgcacc 


gccgcccgcg 


ccc icaccag 






agcagccatg 


gaggaggtgg 


tgattgccgg 


catgtccggg 


aagctgccag 


ag rcggagaa 


loU 


20 


cttgcaggag 


ttctgggaca 


acctcatcgg 


cggtgtggac 


atgg tcacgg 


acga igaccg 






tcgctggaag 


gcggggctct 


acggcctgcc 


ccggcggtcc 


ggcaagctga 


aggaccrg re 






taggtttgat 


gcctccrtct 


tcggagtcca 


ccccaagcag 


gcacacacga 


uggaccc tea 


3 AH 


25 


gctgcggctg 


ctgctggaag 


tcacctatga agccatcgtg 


gacggaggca 


icaacccaga 






ttcactccga 


ggaacacaca 


ctggcgtctg 


ggtgggcgtg 


agcggcxcxg 


agacc xcgga 


*foU 




ggccctgagc 


cgagaccccg 


agacactcgt 


gggctacagc 


atggtgggct 


gccagcgagc 




30 


gatgatggcc 


aaccggctct 


ccttcttctt 


cgacttcaga 


gggcccagca 


rcgcacrgga 


DUU 




cacagcctgc 


tcctccagcc 


tgatggccct 


gcagaacgcc 


raccaggcca 


uccacagcgg 


DDU 




gcagtgccct 


gccgccatcg 


tggggggcat 


caatgtcctg 


c xgaagccca 


acacc Lccgr 




35 


gcagttcttg 


aggctgggga 


tgctcagccc 


cgagggcacc 


rgcaaggccr 


xcgacacagc 


/ oU 




ggggaatggg 


tactgccgct 


cggagggtgt 


ggtggccgtc 


c rgc xgacca 


agaag lccc l 


RAH 




ggcccggcgg 


gtgtacgcca 


ccatcctgaa 


cgccggcacc 


aatacagatg 


gcttcaagga 


900 


40 


gcaaggcgtg 


accttcccct 


caggggatat 


ccaggagcag 


ctcatccgct 


cgttgtacca 


960 




gtcggccgga 


gtggcccctg 


agtcatttga atacatcgaa 


gcccacggca 


caggcaccaa 


1020 




ggtgggcgac 


ccccaggagc 


tgaatggcat 


cacccgagcc 


ctgtgcgcca 


cccgccagga 


1080 


45 


gccgctgctc 


atcggctcca 


ccaagtccaa 


catggggcac 


ccggagccag 


cctcggggct 


1140 




ggcagccctg 


gccaaggtgc 


tgctgtccct 


ggagcacggg 


ctctgggccc 


ccaacctgca 


1200 




cttccatagc 


cccaaccctg 


agatcccagc 


gctgttggat 


gggcggctgc 


aggtggtgga 


1260 


50 


ccagcccctg 


cccgtccgtg 


gcggcaacgt 


gggcatcaac 


tcctttggct 


tcgggggctc 


1320 




caacgtgcac 


atcatcctga 


ggcccaacac 


gcagccgccc 


cccgcacccg 


ccccacatgc 


1380 




caccctgccc 


cgtctgctgc 


gggccagcgg 


acgcacccct 


gaggccgtgc 


agaagctgct 


1440 


55 


ggagcagggc 


ctccggcaca 


gccaggacct 


ggctttcctg 


agcatgctga 


acgacatcgc 


1500 




ggctgtcccc 


gccaccgcca 


tgcccttccg 


tggctacgct 


gtgctgggtg 


gtgagcgcgg 


1560 



24 
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tggcccagag 


gtgcagcagg 


tgcccgctgg 


cgagcgcccg 


crc rggr rca 


re rgc Lcrgg 


IDZU 


gatgggcaca 


cagtggcgcg 


ggatggggct 


gagcctcatg 


cgcctggacc 


gcttxcgaga 


i con 


ttccatccta 


cgctccgatg 


aggctgtgaa 


gccattcggc 


ctgaaggtgt 


cacagctgct 


1 "7/1 ft 


gctgagcaca 


gacgagagca 


cctttgatga 


catcgtccat 


"tcgtttgtga 


gcctgactgc 


i onn 


catccagata 


ggcctcatag 


acctgctgag 


ctgcatgggg 


ctgaggccag 


atggcatcgt 


looU 


cggccactcc 


ctgggggagg 


tggcctgtgg 


c tacgccgac 


ggctgcctg l 


cccaggagga 


xyzu 


ggccgtcctc 


gctgcctact 


ggaggggaca 


gtgcatcaaa 


gaagcccatc 


tcccgccggg 


1 OCA 


cgccatggca 


gccgtgggct 


tgtcctggga 


ggagtgtaaa 


cagcgctgcc 


ccccggcggt 


ZU4U 


ggtgcccgcc 


tgccacaact 


ccaaggacac 


agtcaccatc 


tcgggacctc 


aggccccggt 


*} -I Art 


gtttgagttc 


gtggagcagc 


tgaggaagga 


gggtgtgttt 


gccaaggagg 


tgcggaccgg 


ZlbU 


cggtatggcc 


t tccactcct 


acttcatgga 


ggccatcgca 


cccccactgc 


tgcaggagct 


zzzu 


caagaaggtg 


atccgggagc 


cgaagccacg 


ttcagcccgc 


tggctcagca 


cctctatccc 


ZzoU 


cgaggcccag 


tggcacagca 


gcctggcacg 


cacgtcctcc 


gccgagtaca 


atgtcaacaa 


*> 3 /in. 
z540 


cctggtgagc 


cctgtgctgt 


tccaggaggc 


cctgtggcac 


gtgcctgagc 


acgcggtggt 


2400 


gctggagatc 


gcgccccacg 


ccctgctgca 


ggctgtcctg 


aagcgtggcc 


tgaagccgag 


2460 


ctgcaccatc 


atccccctga 


tgaagaagga 


tcacagggac 


aacctggagt 


tctxcctggc 


2dZ0 


cggcatccgg 


aggctgcacc 


tctcaggcat 


cgacgccaac 


cccaatgcct 


tgttcccacc 


ZboO 


tgtggagttc 


ccagctcccc 


gaggaactcc 


cctcatctcc 


ccactcatca 


agtgggacca 


Zb40 


cagcctggcc 


tgggacgtgc 


cggccgccga 


ggacttcccc 


aacggttcag 


gttccccctc 


2700 


agccgccatc 


tacaacatcg 


acaccagctc 


cgagtctcct 


gaccactacc 


tggtggacca 


2/ bO 


caccctcgac 


ggtcgcgtcc 


tcttccccgc 


cactggctac 


ctgagcatag 


tgtggaagac 


2o20 


gctggcccga 


cccctgggcc 


tgggcgtcga 


gcagctgcct 


~ -4- *~m -4— -4- -4- -4- jp-. 

gtggtgtttg 


aggatgtggt 


noon 

2oo0 


gctgcaccag 


gccaccatcc 


tgcccaagac 


tgggacagtg 


tccctggagg 


tacggctcct 


2940 


ggaggcctcc 


cgtgccttcg 


aggtgtcaga 


gaacggcaac 


ctggtagtga 


gtgggaaggt 


3000 


gtaccagtgg 


gatgaccctg 


accccaggct 


cttcgaccac 


ccggaaagcc 


ccacccccaa 


30b0 


ccccacggag 


cccctcttcc 


tggcccaggc 


tgaag t t rac 


aaggagctgc 


gtctgcgtgg 


3120 


ctacgactac 


ggccctcatt 


tccagggcat 


cctggaggcc 


agcctggaag 


gtgactcggg 


3lo0 


gaggctgctg 


tggaaggata 


actgggtgag 


cttcatggac 


accatgctgc 


agatgtccat 


3240 


cctgggctcg 


gccaagcacg 


gcctgtacct 


gcccacccgt 


gtcaccgcca 


tccacatcga 


3300 


ccctgccacc 


cacaggcaga 


agctgtacac 


actgcaggac 


aaggcccaag 


tggctgacgt 


33b0 


ggtggtgagc 


aggtggctga 


gggtcacagt 


ggccggaggc 


gtccacatct 


ccgggctcca 


3420 


cactgagtcg 


gccccgcggc 


ggcagcagga 


gcagcaggtg 


cccatcctgg 


agaagttttg 


3480 


cttcactccc 


cacacggagg 


aggggtgcct 


gtctgagcgc 


gctgccctgc 


aggaggagct 


3540 


gcaactgtgc 


aaggggctgg 


tgcaggcact 


gcagaccaag 


gtgacccagc 


aggggctgaa 


3600 
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gatggtggtg 


cccggactgg 


atggggccca 


gatcccccgg 


gacccctcac 


agcaggaact 


3660 




gccccggctg 


ttgtcggctg 


cctgcaggct 


tcagctcaac 


gggaacctgc 


agctggagct 


3720 


5 


ggcgcaggtg 


ctggcccagg 


agaggcccaa 


gctgccagag 


gaccctctgc 


tcagcggcct 


3780 




cctggactcc 


ccggcactca 


aggcctgcct 


ggacactgcc 


gtggagaaca 


tgcccagcct 


3840 




gaagatgaag 


gtggtggagg 


tgctggccgg 


ccacggtcac 


ctgtattccc 


gcatcccagg 


3900 


10 


cctgctcagc 


ccccatcccc 


tgctgcagct 


gagctacacg 


gccaccgacc 


gccaccccca 


3960 




ggccctggag 


gctgcccagg 


ccgagctgca 


gcagcacgac 


gttgcccagg 


gccagtggga 


4020 




tcccgcagac 


cctgccccca 


gcgccctggg 


cagcgccgac 


ctcctggtgt 


gcaactgtgc 


4080 


15 


tgtggctgcc 


ctcggggacc 


cggcctcagc 


tctcagcaac 


atggtggctg 


ccctgagaga 


4140 




agggggcttt 


ctgctcctgc 


acacactgct 


ccgggggcac 


ccctcgggac 


atgtggcctt 


4200 




cctcacctcc 


actgagccgc 


agtatggcca 


gggcatcctg 


agccaggacg 


cgtgggagag 


4260 


20 


cctcttctcc 


agggtgtccg 


tgcgcctggt 


gggcctgaag 


aagtccttct 


acggctccac 


4320 




gctcttcctg 


tgccgccggc 


ccaccccgca 


ggacagcccc 


atcttcctgc 


cggtggacga 


4380 




taccagcttc 


cgctgggtgg 


agtctctgaa gggcatcctg 


gctgacgaag 


actcttcccg 


4440 


25 


gcctgtgtgg 


ctgaaggcca 


tcaactgtgc 


cacctcgggc 


gtggtgggct 


tggtgaactg 


4500 




tctccgccga 


gagcccggcg 


gaacgctccg 


gtgtgtgctg 


ctctccaacc 


tcagcagcac 


4560 




ctcccacgtc 


ccggaggtgg 


acccgggctc 


cgcagaactg 


cagaaggtgt 


tgcagggaga 


4620 


30 


cctggtgatg 


aacgtctacc 


gcgacggggc 


ctggggggct 


ttccgccact 


tcctgctgga 


4680 




ggaggacaag 


cctgaggagc 


cgacggcaca 


tgcctttgtg 


agcaccctca 


cccgggggga 


4740 




cctgtcctcc 


atccgctggg 


tctgctcctc 


gctgcgccat 


gcccagccca 


cctgccctgg 


4800 


35 


cgcccagctc 


tgcacggtct 


actacgcctc 


cctcaacttc 


cgcgacatca 


tgctggccac 


4860 




tggcaagctg 


tcccctgatg 


ccatcccagg 


gaagtggacc 


tcccaggaca 


gcctgctagg 


4920 




tatggagttc 


tcgggccgag 


acgccagcgg 


caagcgtgtg 


atgggactgg 


tgcctgccaa 


4980 


40 


gggcctggcc 


acctctgtcc 


tgctgtcacc 


ggacttcctc 


tgggatgtgc 


cttccaactg 


5040 




gacgctggag 


gaggcggcct 


cggtgcctgt 


cgtctacagc 


acggcctact 


acgcgctggt 


5100 




ggtgcgtggg 


cgggtgcgcc 


ccggggagac 


gctgctcatc 


cactcgggct 


cgggcggcgt 


5160 


45 


gggccaggcc 


gccatcgcca 


tcgccctcag 


tctgggctgc 


cgcgtcttca 


ccaccgtggg 


5220 




gtcggctgag 


aagcgggcgt 


acctccaggc 


caggttcccc 


cagctcgaca 


gcaccagctt 


5280 




cgccaactcc 


cgggacacat 


ccttcgagca gcatgtgctg 


tggcacacgg 


gcgggaaggg 


5340 


50 


cgttgacctg 


gtcttgaact 


ccttggcgga 


agagaagctg 


caggccagcg 


tgaggtgctt 


5400 




ggctacgcac 


ggtcgcttcc 


tggaaattgg 


caaattcgac 


ctttctcaga 


accacccgct 


5460 




cggcatggct 


atcttcctga 


agaacgtgac 


attccacggg 


gtcctactgg 


atgcgttctt 


5520 


55 


caacgagagc 


agtgctgact 


ggcgggaggt 


gtgggcgctt 


gtgcaggccg 


gcatccggga 


5580 




tggggtggta 


cggcccctca 


agtgcacggt 


gttccatggg 


gcccaggtgg 


aggacgcctt 


5640 
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ccgctacatg 


gcccaaggga 


agcacattgg 


caaagtcgtc 


gtgcaggtgc 


l tgcggagga 


C7nn 

3/UU 




gccggaggca 


gtgctgaagg 


gggccaaacc 


caagctgatg 


tcggccatct 


ccaagacctt 


j f DU 


5 


ctgcccggcc 


cacaagagct 


acatcatcgc 


tggtggtctg 


ggtggcttcg 


gcctggagtt 


coin 




ggcgcagtgg 


ctgatacagc 


gtggggtgca 


gaagctcgtg 


ttgacttcTc 


gctccgggat 


coon 
jooU 




ccggacaggc 


taccaggcca 


agcaggtccg 


ccggtggagg 


cgccagggcg 


tacaggtgca 




10 


ggtgtccacc 


agcaacatca 


gctcactgga 


gggggcccgg 


ggcctcattg 


ccgaggcggc 


duuu 




gcagcttggg 


cccgtgggcg 


gcgtcttcaa 


cctggccgtg 


g tct tgagag 


a xggc r xgc l 


OUOU 




ggagaaccag 


accccagagt 


tcttccagga 


cgtctgcaag 


cccaagtaca 


gcggcacccr 


DiZU 


15 


gaacctggac 


agggtgaccc 


gagaggcgtg 


ccctgagctg 


gactactttg 


tggtcttctc 


CI OA 




ctctgtgagc 


tgcgggcgtg 


gcaatgcggg 


acagagcaac 


tacggctttg 


ccaattccgc 


bZ4U 




catggagcgt 


atctgtgaga 


aacgccggca 


cgaaggcctc 


ccaggcctgg 


ccgtgcagtg 


C5AA 

D3UU 


20 


gggcgccatc 


ggcgacgtgg 


gcattttggt 


ggagacgatg 


agcaccaacg 


acacgatcgt 


bibu 




cagtggcacg 


ctgccccagc 


gcatggcgtc 


ctgcctggag 


gtgctggacc 


tcttcctgaa 


D4ZU 




ccagccccac 


atggtcctga 


gcagctttgt 


gctggctgag 


aaggctgcgg 


cctataggga 


c>i on 
D4oU 


25 


cagggacagc 


cagcgggacc 


tggtggaggc 


cgtggcacac 


atcctgggca 


tccgcgactt 


rr/|A 

co40 




ggctgctgtc 


aacctggaca 


gctcactggc 


ggacctgggc 


ctggactcgc 


tcatgagcgt 


bbUU 




ggaggtgcgc 


cagacgctgg 


agcgtgagct 


caacctggtg 


ctgtccgtgc 


gcgaggtgcg 


bbbU 


30 


gcaactcacg 


ctccggaaac 


tgcaggagct 


gtcctcaaag 


gcggatgagg 


ccagcgagct 


b/ ZU 




ggcatgcccc 


acgcccaagg 


aggatggtct 


ggcccagcag 


cagactcagc 


tgaacctgcg 


b/oU 




ctccctgctg 


gtgaacccgg 


agggccccac 


cctgatgcgg 


ctcaactccg 


tgcagagctc 


CO /I A 

bo40 


35 


ggagcggccc 


ctgttcctgg 


tgcacccaat 


cgagggctcc 


accaccgtgt 


tccacagcct 


conn 

byuu 




ggcctcccgg 


ctcagcatcc 


ccacctatgg 


cctgcagtgc 


acccgagctg 


cgccccttga 


bybU 




cagcatccac 


agcctggctg 


cctactacat 


cgactgcatc 


aggcaggtgc 


agcccgaggg 


/uzu 


40 


cccctaccgc 


gtggccggct 


actcctacgg 


ggcctgcgtg 


gcctttgaaa 


tgtgctccca 


7AQA 




gctgcaggcc 


cagcagagcc 


cagcccccac 


ccacaacagc 


ctcttcctgt 


tcgacggctc 


/ 14U 




gcccacctac 


gtactggcct 


acacccagag 


ctaccgggca 


aagctgaccc 


caggctgtga 


/ZUU 


45 


ggctgaggct 


gagacggagg 


ccatatgctt 


cttcgtgcag 


cagttcacgg 


acatggagca 


7")CA 

/ Zbu 




caacagggtg 


ctggaggcgc 


tgctgccgct 


gaagggccta 


gaggagcgtg 


tggcagccgc 






cgtggacctg 


atcatcaaga 


gccaccaggg 


cctggaccgc 


caggagctga 


gctttgcggc 


73QA 


50 


ccggtccttc 


tactacaagc 


tgcgtgccgc 


tgagcagtac 


acacccaagg 


ccaagtacca 


7v1 /I A 
/44U 




tggcaacgtg 


atgctactgc 


gcgccaagac 


gggtggcgcc 


tacggcgagg 


acctgggcgc 


7500 




ggactacaac 


ctctcccagg 


tatgcgacgg 


gaaagtatcc 


gtccacgtca 


tcgagggtga 


7560 


55 


ccaccgcacg 


ctgctggagg 


gcagcggcct 


ggagtccatc 


atcagcatca 


tccacagctc 


7620 


cctggctgag 


ccacgcgtga 


gcgtgcggga 


gggctaggcc 


cgtgcccccg 


cctgccaccg 


7680 
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gaggtcactc 


caccatcccc 


accccacccc 


accccacccc 


cgccatigcaa 


cgggaLiigaa 


/ / 4U 




gggtcctgcc 


ggtgggaccc 


tgtccggccc 


agtgccac tg 


ccccccgagg 


c xgcragacg 


/ ouu 


5 


taggtgttag 


gcatgtccca 


cccacccgcc 


gcctcccacg 


gcacctcggg 


gacaccagag 


70CA 

/ oDU 




ctgccgactt 


ggagactcct 


ggtctgtgaa 


gagccggtgg 


tgcccgtgcc 


cgcaggaact 


7QTA 

/ y^u 




gggctgggcc 


tcgtgcgccc 


gtggggtctg 


cgcttggtct 


ttctgrgct l 


ggauTtgcaL 


/you 


10 


atttattgca 


ttgctggtag 


agacccccag 


gcctgtccac 


cc ngccaaga 


c lcc xcaggc 


oU4U 




agcgtgtggg 


tcccgcactc 


tgcccccatt 


tccccgatgt 


cccctgcggg 


cgcgggcagc 


oXUU 




cacccaagcc 


tgctggctgc 


ggccccctct 


cggccaggca 


ttggctcagc 


ccgctgagtg 


o-LoU 


15 


gggggtcgtg 


ggccagtccc 


cgaggagctg 


ggcccctgca 


caggcacaca 


gggcccggcc 






acacccagcg 


gccccccgca 


cagccacccg 


tggggtgctg 


cccttatgcc 


cggcgccggg 


8280 




rarraactcc 




tttgtctgtg 


tttgtttttc 


aagaaatgat 


tcaaattgct 


8340 


20 


gcttggattt 


tgaaatttac 


tgtaactgtc 


agtgtacacg 


tctggacccc 


gtttcatttt 


8400 




tacaccaatt 


tggtaaaaat 


gctgctctca gcctcccaca 


attaaaccgc 


atgtgatctc 


8460 




c 












8461 


25 
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35 


<400> 5 
acggtcaccc 


gttgccagct 


ctagccttta 


aattcccggc 


tcggggacct 


ccacgcaccg 


oO 


cggctagcgc 


cgacaaccag 


ctagcgtgca 


aggcgccgcg 


gctcagcgcg 


taccggcggg 






cttcgaaacc 


gcagtcctcc 


ggcgaccccg 


aactccgctc 


cggagcctca 


gccccctgga 


JLoU 


40 


aagtgatccc 


ggcatccgag 


agccaagatg 


ccggcccact 


tgctgcagga 


cgatatctct 




agctcctata 


ccaccaccac 


caccattaca gcgcctccct 


ccagggtcct 


gcagaatgga 


jnn 




ggagataagt 


tggagacgat 


gcccctctac 


ttggaagacg 


acattcgccc 


tgatataaaa 




45 


gatgatatat 


atgaccccac 


ctacaaggat 


aaggaaggcc 


caagccccaa 


ggttgaatat 




gtctggagaa 


acatcatcct 


tatgtctctg 


ctacacttgg 


gagccctgta 


tgggatcact 


>i on 




ttgattccta 


cctgcaagtt 


ctacacctgg 


ctttgggggg 


tattctacta 


ttttgtcagt 




50 


gccctgggca 


taacagcagg 


agctcatcgt 


ctgtggagcc 


accgctctta 


caaagctcgg 


enn 


ctgcccctac 


ggctctttct 


gatcattgcc 


aacacaatgg 


cattccagaa 


tgatgtctat 


660 




gaatgggctc 


gtgaccaccg 


tgcccaccac 


aagttttcag 


aaacacatgc 


tgatcctcat 


720 




aattcccgac 


gtggcttttt 


cttctctcac gtgggttggc 


tgcttgtgcg 


caaacaccca 


780 


55 


gctgtcaaag 


agaaggggag 


tacgctagac 


ttgtctgacc 


tagaagctga 


gaaactggtg 


840 
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atgttccaga 


ggaggtacta 


caaacctggc 


ttgctgatga 


tgtgcttcat 


cctgcccacg 


900 




cttgtgccct 


ggtatttctg 


gggtgaaact 


tttcaaaaca 


gtgtgttcgt 


tgccactttc 


960 


5 


ttgcgatatg 


ctgtggtgct 


taatgccacc 


tggctggtga 


acagtgctgc 


ccacctcttc 


i a *> a 
1020 




ggatatcgtc 


cttatgacaa 


gaacattagc 


ccccgggaga 


atatcctggt 


ttcac ttgga 


1080 




gctgtgggtg 


agggcttcca 


caactaccac 


cactcctttc 


cctatgacta 


ctctgccagt 


1 1 yl A 

1140 


10 


gagtaccgct 


ggcacatcaa 


cttcaccaca ttcttcattg 


attgcatggc 


cgccctcggt 


1200 




ctggcctatg 


accggaagaa 


agtctccaag 


gccgccatct 


tggccaggat 


taaaagaacc 


1260 




ggagatggaa 


actacaagag 


tggctgagtt 


tggggtccct 


caggttcctt 


tttcaaaaac 


1320 


15 


cagccaggca 


gaggttttaa 


tgtctgttta ttaactactg 


aataatgcta 


ccaggatgct 


1380 




aaagatgatg 


atgttaaccc 




ciy Uu L V- V_ L u u 


taaaattcaa 


aagtattgaa 


1440 




agcc 












1444 
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25 
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30 


<400> 6 
taacgaggaa 


cttttcgccg 


acQccaaacc 


y v_ v_ *. \_ i_ y a y y 


ccagggcagg 


acacgaacgc 


60 




gcggagcggc 


ggcggcgact 


y**y*y*-^-yyy 


y^^y^-yy^-yy 


cgctccctag 


gaagggccgt 


i ~>a 
120 




acgaggcggc 


gggcccggcg 


ggcctcccgg 


aggaggcggc 


tgcgccatgg 


acgagccacc 


-1 OA 

180 


35 


cttcagcgag 


gcggctttgg 


agcaggcgct 


gggcgagccg 


tgcgatctgg 


acgcggcgct 


240 




gctgaccgac 


atcgaagaca 


tgcttcagct 


tatcaacaac 


caagacagtg 


acttccctgg 


300 




cctatttgac 


ccaccctatg 


ctgggagtgg 


ggcagggggc 


acagaccctg 


ccagccccga 


OCA 

360 


40 


taccagctcc 


ccaggcagct 


tgtctccacc 


tcctgccaca 


ttgagctcct 


ctcttgaagc 


420 




cttcctgagc 


gggccgcagg 


cagcgccctc 


acccctgtcc 


cctccccagc 


ctgcacccac 


>i on 

480 




tccattgaag 


atgtacccgt 


ccatgcccgc 


tttctcccct 


gggcctggta 


tcaaggaaga 


r <a 

540 


45 


gtcagtgcca 


ctgagcatcc 


tgcagacccc 


caccccacag 


cccctgccag 


gggccctcct 


bOO 


gccacagagc 


ttcccagccc 


cagccccacc 


gcagttcagc 


tccacccctg 


tgttaggcta 


DOO 




ccccagccct 


ccgggaggct 


tctctacagg 


aagccctccc 


gggaacaccc 


agcagccgct 


7"> A 

720 


50 


gcctggcctg 


ccactggctt 


ccccgccagg 


ggtcccgccc 


gtctccttgc 


acacccaggt 


"7 OA 

780 


ccagagtgtg 


gtcccccagc 


agctactgac 


agtcacagct 


gcccccacgg 


cagcccctgt 


840 




aacgaccact 


gtgacctcgc 


agatccagca ggtcccggtc 


ctgctgcagc 


cccacttcat 


900 


55 


caaggcagac 


tcgctgcttc 


tgacagccat 


gaagacagac 


ggagccactg 


tgaaggcggc 


960 


aggtctcagt 


cccctggtct 


ctggcaccac 


tgtgcagaca 


gggcctttgc 


cgaccctggt 


1020 
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gagtggcgga 


accatcttgg 


caacagtccc 


actggtcgta 


gatgcggaga 


agctgcc xar 


XUoU 


caaccggctc 


gcagctggca 


gcaaggcccc 


ggcctctgcc 


cagagccgtg 


gagagaagcg 


1 1 An 


cacagcccac 


aacgccattg 


agaagcgcta 


ccgctcctcc 


atcaatgaca 


aaatcattga 


i *>nn 
1ZUU 


gctcaaggat 


ctggtggtgg 


gcactgaggc 


aaagctgaat 


aaatctgctg 


tcttgcgcaa 


1 OCA 


ggccatcgac 


tacattcgct 


ttctgcaaca 


cagcaaccag 


aaactcaagc 


aggagaacct 


i 3 ">n 


aagtctgcgc 


actgctgtcc 


acaaaagcaa 


atctctgaag 


gatctggtgt 


cggcctgtgg 


-L5oU 


cagtggaggg 


aacacagacg 


tgctcatgga 


gggcgtgaag 


actgaggtgg 


aggacacact 


1 ^ VIA 

144U 


gaccccaccc 


ccctcggatg 


ctggctcacc 


tttccagagc 


agccccttgt 


cccttggcag 


i cnn 
±!>UU 


caggggcagt 


ggcagcggtg 


gcagtggcag 


tgactcggag 


S— y— <4- y-- ^ y— — % y** y— y— 

cctgacagcc 


cagtct rtga 


J-DbU 


ggacagcaag 


gcaaagccag 


agcagcggcc 


gtctctgcac 


agccggggca 


tgctggaccg 


1 con 


ctcccgcctg 


gccctgtgca 


cgctcgtctt 


cctctgcctg 


tcctgcaacc 


ccttggcc lc 


t con 


cttgctgggg 


gcccgggggc 


ttcccagccc 


ctcagatacc 


accagcgtct 


accatagccc 


1/40 


tgggcgcaac 


gtgctgggca 


ccgagagcag 


agatggccct 


ggctgggccc 


agtggctgct 


loOU 


gcccccagtg 


gtctggctgc 


tcaatgggct 


gttggtgctc 


gtctccttgg 


-t- mm -*- -4- r- -4- -#- 

tgcttctctt 


lobU 


tgtctacggt 


gagccagtca 


cacggcccca 


ctcaggcccc 


gccgtgtact 


tctggaggca 




tcgcaagcag 


gctgacctgg 


acctggcccg 


gggagacttt 


gcccaggctg 


cccagcagct 


1 QOA 


gtggctggcc 


ctgcgggcac 


tgggccggcc 


cctgcccacc 


tcccacctgg 


acctggcttg 


ZU4U 


tagcctcctc 


tggaacctca 


tccgtcacct 


gctgcagcgt 


ctctgggtgg 


gccgctggct 


ZlUU 


ggcaggccgg 


gcagggggcc 


tgcagcagga 


ctgtgctc tg 


cgagtggatg 


ctagcgccag 


"> i cn 
ZlbU 


cgcccgagac 


gcagccctgg 


tctaccataa 


gctgcaccag 


ctgcacacca 


tggggaagca 




cacaggcggg 


cacctcactg 


ccaccaacct 


ggcgctgagt 


gccctgaacc 


tggcagagtg 


-> t on 


tgcaggggat 


gccgtgtctg 


tggcgacgct 


ggccgagatc 


tatgtggcgg 


ctgcattgag 


Z54U 


agtgaagacc 


agtctcccac 


gggccttgca 


ttttctgaca 


cgcttcttcc 


tgagcagtgc 


Z4UU 


ccgccaggcc 


tgcctggcac 


agagtggctc 


agtgcctcct 


gccatgcagt 


ggctctgcca 


z4bU 


ccccgtgggc 


caccgtttct 


tcgtggatgg 


ggactggtcc 


gtgctcagta 


ccccatggga 


ZDZU 


gagcctgtac 


agcttggccg 


ggaacccagt 


ggaccccctg 


gcccaggtga 


ctcagctatt 




ccgggaacat 


ctcttagagc 


gagcactgaa 


ctgtgtgacc 


cagcccaacc 


ccagccctgg 


Zo4U 


gtcagctgat 


ggggacaagg 


aattctcgga 


tgccctcggg 


tacctgcagc 


tgctgaacag 


Z/UU 


ctgttctgat 


gctgcggggg 


ctcctgccta 


cagcttctcc 


atcagttcca 


gcatggccac 


z/bu 


caccaccggc 


gtagacccgg 


tggccaagtg 


gtgggcctct 


ctgacagctg 


tggtgatcca 


*> Q "> A 


ctggctgcgg 


cgggatgagg 


aggcggctga 


gcggctgtgc 


ccgctggtgg 


agcacctgcc 


2880 


ccgggtgctg 


caggagtctg 


agagacccct 


gcccagggca 


gctctgcact 


ccttcaaggc 


2940 


tgcccgggcc 


ctgctgggct 


gtgccaaggc 


agagtctggt 


ccagccagcc 


tgaccatctg 


3000 


tgagaaggcc 


agtgggtacc 


tgcaggacag 


cctggctacc 


acaccagcca 


gcagctccat 


3060 
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tgacaaggcc 


gtgcagctgt 


tcctgtgtga 


cctgcttctt 


gtggtgcgca 


ccagcctgtg 


31ZU 




gcggcagcag 


cagcccccgg 


ccccggcccc 


agcagcccag 


ggcaccagca 


gcaggcccca 


3180 


5 


ggcttccgcc 


cttgagctgc 


gtggcttcca 


acgggacctg 


agcagcctga 


ggcggctggc 


3240 




acagagcttc 


cggcccgcca 


tgcggagggt 


gttcctacat 


gaggccacgg 


cccggctgat 


3300 




ggcgggggcc 


agccccacac 


ggacacacca 


gctcctcgac 


cgcagtctga 


ggcggcgggc 


3360 


10 


aggccccggt 


ggcaaaggag 


gcgcggtggc 


ggagctggag 


ccgcggccca 


cgcggcggga 


3420 




gcacgcggag 


gccttgctgc 


tggcctcctg 


ctacctgccc 


cccggcttcc 


tgtcggcgcc 


3480 




cgggcagcgc 


gtgggcatgc 


tggctgaggc 


ggcgcgcaca 


ctcgagaagc 


ttggcgatcg 


3540 


15 


ccggctgctg 


cacgactgtc 


agcagatgct 


catgcgcctg 


ggcggtggga 


ccactgtcac 


3600 




ttccagctag 


accccgtgtc 


cccggcctca 


gcacccctgt 


ctctagccac 


tttggtcccg 


3660 




tgcagcttct 


gtcctgcgtc 


gaagctttga 


aggccgaagg 


cagtgcaaga 


gactctggcc 


3720 


20 


tccacagttc 


gacctgcggc 


tgctgtgtgc 


cttcgcggtg 


gaaggcccga 


ggggcgcgat 


370A 

3780 




cttgacccta 


agaccggcgg 


ccatgatggt 


gctgacctct 


ggtggccgat 


cggggcactg 


3840 




caggggccga 


gccattttgg 


ggggcccccc 


tccttgctct 


gcaggcacct 


tagtggcttt 


3900 


25 


tttcctcctg 


tgtacaggga 


agagaggggt 


acatttccct 


gtgctgacgg 


aagccaactt 


3960 




ggctttcccg 


gactgcaagc 


agggctctgc 


cccagaggcc 


tctctctccg 


tcgtgggaga 


4020 




gagacgtgta 


catagtgtag 


gtcagcgtgc 


ttagcctcct 


gacctgaggc 


tcctgtgcta 


4080 


30 


ctttgccttt 


tgcaaacttt 


attttcatag 


attgagaagt 


tttgtacaga 


gaattaaaaa 


4140 




tgaaattatt 


tata 
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<400> 7 

gtaattgcga gcgagagtga 


gtggggccgg 


gacccgcaga 


gccgagccga 


cccttctctc 


60 


ccgggctgcg gcagggcagg 


gcggggagct 


ccgcgcacca 


acagagccgg 


ttctcagggc 


120 


gctttgctcc ttgttttttc 


cccggttctg 


ttttctcccc 


ttctccggaa 


ggcttgtcaa 


180 


ggggtaggag aaagagacgc 


aaacacaaaa 


gtggaaaaca 


gttaatgacc 


agccacggcg 


240 


tccctgctgt gagctctggc 


cgctgccttc 


cagggctccc 


gagccacacg 


ctgggggtgc 


300 


tggctgaggg aacatggctt 


gttggcctca 


gctgaggttg 


ctgctgtgga 


agaacctcac 


360 


tttcagaaga agacaaacat 


gtcagctgct 


gctggaagtg 


gcctggcctc 


tatttatctt 


420 


cctgatcctg atctctgttc 


ggctgagcta 


cccaccctat 


gaacaacatg 


aatgccattt 


480 


tccaaataaa gccatgccct 


ctgcaggaac 


acttccttgg 


gttcagggga 


ttatctgtaa 


540 
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tgccaacaac 


ccctgtttcc 


gttacccgac 


tcctggggag 


gctcccggag 


ttgttggaaa 


600 




ctttaacaaa 


tccattgtgg 


ctcgcctgtt 


ctcagatgct 


cggaggcttc 


ttttatacag 


660 


5 


ccagaaagac 


accagcatga 


aggacatgcg 


caaagttctg 


agaacattac 


agcagatcaa 


720 




gaaatccagc 


tcaaacttga 


agcttcaaga 


tttcctggtg 


gacaatgaaa 


ccttctctgg 


780 




gttcctgtat 


cacaacctct 


ctctcccaaa 


gtctactgtg 


gacaagatgc 


tgagggctga 


840 


10 


tgtcattctc 


cacaaggtat 


ttttgcaagg 


ctaccagtta 


catttgacaa 


gtctgtgcaa 


900 




tggatcaaaa 


tcagaagaga 


tgattcaact 


tggtgaccaa 


gaagtttctg 


agctttgtgg 


960 




cctaccaagg 


gagaaactgg 


ctgcagcaga 


gcgagtactt 


cgttccaaca 


tggacatcct 


1020 


15 


gaagccaatc 


ctgagaacac 


taaactctac 


atctcccttc 


ccgagcaagg 


agctggctga 


1080 




agccacaaaa 


acattgctgc 


atagtcttgg 


gactctggcc 


caggagctgt 


tcagcatgag 


1140 




aagctggagt 


gacatgcgac 


aggaggtgat 


gtttctgacc 


aatgtgaaca 


gctccagctc 


1200 


20 


ctccacccaa 


atctaccagg 


ctgtgtctcg 


tattgtctgc 


gggcatcccg 


agggaggggg 


1260 




gctgaagatc 


aagtctctca 


actggtatga 


ggacaacaac 


tacaaagccc 


tctttggagg 


1320 




caatggcact 


gaggaagatg 


ctgaaacctt 


ctatgacaac 


tctacaactc 


cttactgcaa 


1380 


25 


tgatttgatg 


aagaatttgg 


agtctagtcc 


tctttcccgc 


attatctgga 


aagctctgaa 


1440 




gccgctgctc 


gttgggaaga 


tcctgtatac 


acctgacact 


ccagccacaa 


ggcaggtcat 


1500 




ggctgaggtg 


aacaagacct 


tccaggaact 


ggctgtgttc 


catgatctgg 


aaggcatgtg 


1560 


30 


ggaggaactc 


agccccaaga 


tctggacctt 


catggagaac 


agccaagaaa 


tggaccttgt 


1620 




ccggatgctg 


ttggacagca 


gggacaatga 


ccacttttgg 


gaacagcagt 


tggatggctt 


1680 




agattggaca 


gcccaagaca 


tcgtggcgtt 


tttggccaag 


cacccagagg 


atgtccagtc 


1740 


35 


cagtaatggt 


tctgtgtaca 


cctggagaga 


agctttcaac 


gagactaacc 


aggcaatccg 


1800 




gaccatatct 


cgcttcatgg 


agtgtgtcaa 


cctgaacaag 


ctagaaccca 


tagcaacaga 


1860 




agtctggctc 


atcaacaagt 


ccatggagct 


gctggatgag 


aggaagttct 


gggctggtat 


1920 


40 


tgtgttcact 


ggaattactc 


caggcagcat 


tgagctgccc 


catcatgtca 


agtacaagat 


1980 




ccgaatggac 


attgacaatg 


tggagaggac 


aaataaaatc 


aaggatgggt 


actgggaccc 


2040 




tggtcctcga 


gctgacccct 


ttgaggacat 


gcggtacgtc 


tgggggggct 


tcgcctactt 


2100 


45 


gcaggatgtg 


gtggagcagg 


caatcatcag 


ggtgctgacg 


ggcaccgaga 


agaaaactgg 


2160 




tgtctatatg 


caacagatgc 


cctatccctg 


ttacgttgat 


gacatctttc 


tgcgggtgat 


2220 




gagccggtca 


atgcccctct 


tcatgacgct 


ggcctggatt 


tactcagtgg 


ctgtgatcat 


2280 


50 


caagggcatc 


gtgtatgaga 


aggaggcacg 


gctgaaagag 


accatgcgga 


tcatgggcct 


2340 




ggacaacagc 


atcctctggt 


ttagctggtt 


cattagtagc 


ctcattcctc 


ttcttgtgag 


2400 




cgctggcctg 


ctagtggtca 


tcctgaagtt 


aggaaacctg 


ctgccctaca 


gtgatcccag 


2460 


55 


cgtggtgttt 


gtcttcctgt 


ccgtgtttgc 


tgtggtgaca 


atcctgcagt 


gcttcctgat 


2520 




tagcacactc 


ttctccagag 


ccaacctggc 


agcagcctgt 


gggggcatca 


tctacttcac 


2580 
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gctgtacctg 


ccctacgtcc 


tgtgtgtggc 


atggcaggac 


tacgtgggct 


tcacactcaa 


2640 


gatcttcgct 


agcctgctgt 


ctcctgtggc 


ttttgggttt 


ggctgtgagt 


actttgccct 


2700 


ttttgaggag 


cagggcattg 


gagtgcagtg 


ggacaacctg 


tttgagagtc 


ctgtggagga 


2760 


agatggcttc 


aatctcacca 


cttcggtctc 


catgatgctg 


tttgacacct 


tcctctatgg 


2820 


ggtgatgacc 


tggtacattg 


aggctgtctt 


tccaggccag 


tacggaattc 


ccaggccctg 


2880 


gtattttcct 


tgcaccaagt 


cctactggtt 


tggcgaggaa 


agtgatgaga 


agagccaccc 


2940 


tggttccaac 


cagaagagaa 


tatcagaaat 


ctgcatggag 


gaggaaccca 


cccacttgaa 


3000 


gctgggcgtg 


tccattcaga 


acctggtaaa 


agtctaccga 


gatgggatga 


aggtggctgt 


3060 


cgatggcctg 


gcactgaatt 


tttatgaggg 


ccagatcacc 


tccttcctgg 


gccacaatgg 


3120 


agcggggaag 


acgaccacca 


tgtcaatcct 


gaccgggttg 


ttccccccga 


cctcgggcac 


3180 


cgcctacatc 


ctgggaaaag 


acattcgctc 


tgagatgagc 


accatccggc 


agaacctggg 


3240 


ggtctgtccc 


cagcataacg 


tgctgtttga 


catgctgact 


gtcgaagaac 


acatctggtt 


3300 


ctatgcccgc 


ttgaaagggc 


tctctgagaa 


gcacgtgaag 


gcggagatgg 


agcagatggc 


3360 


cctggatgtt 


ggtttgccat 


caagcaagct 


gaaaagcaaa 


acaagccagc 


tgtcaggtgg 


3420 


aatgcagaga 


aagctatctg 


tggccttggc 


ctttgtcggg 


ggatctaagg 


ttgtcattct 


3480 


ggatgaaccc 


acagctggtg 


tggaccctta 


ctcccgcagg 


ggaatatggg 


agctgctgct 


3540 


gaaataccga 


caaggccgca 


ccattattct 


ctctacacac 


cacatggatg 


aagcggacgt 


3600 


cctgggggac 


aggattgcca 


tcatctccca 


tgggaagctg 


tgctgtgtgg 


gctcctccct 


3660 


gtttctgaag 


aaccagctgg 


gaacaggcta 


ctacctgacc 


ttggtcaaga 


aagatgtgga 


3720 


atcctccctc 


agttcctgca 


gaaacagtag 


tagcactgtg 


tcatacctga 


aaaaggagga 


3780 


cagtgtttct 


cagagcagtt 


ctgatgctgg 


cctgggcagc 


gaccatgaga 


gtgacacgct 


3840 


gaccatcgat 


gtctctgcta 


tctccaacct 


catcaggaag 


catgtgtctg 


aagcccggct 


3900 


ggtggaagac 


atagggcatg 


agctgaccta 


tgtgctgcca 


tatgaagctg 


ctaaggaggg 


3960 


agcctttgtg 


gaactctttc 


atgagattga 


tgaccggctc 


tcagacctgg 


gcatttctag 


4020 


ttatggcatc 


tcagagacga 


ccctggaaga 


aatattcctc 


aaggtggccg 


aagagagtgg 


4080 


ggtggatgct 


gagacctcag 


atggtacctt 


gccagcaaga 


cgaaacaggc 


gggccttcgg 


4140 


ggacaagcag 


agctgtcttc 


gcccgttcac 


tgaagatgat 


gctgctgatc 


caaatgattc 


4200 


tgacatagac 


ccagaatcca 


gagagacaga 


cttgctcagt 


gggatggatg 


gcaaagggtc 


4260 


ctaccaggtg 


aaaggctgga 


aacttacaca 


gcaacagttt 


gtggcccttt 


tgtggaagag 


4320 


actgctaatt 


gccagacgga 


gtcggaaagg 


attttttgct 


cagattgtct 


tgccagctgt 


4380 


gtttgtctgc 


attgcccttg 


tgttcagcct 


gatcgtgcca 


ccctttggca 


agtaccccag 


4440 


cctggaactt 


cagccctgga 


tgtacaacga 


acagtacaca 


tttgtcagca 


atgatgctcc 


4500 


tgaggacacg 


ggaaccctgg 


aactcttaaa 


cgccctcacc 


aaagaccctg 


gcttcgggac 


4560 


ccgctgtatg 


gaaggaaacc 


caatcccaga 


cacgccctgc 


caggcagggg 


aggaagagtg 


4620 
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gaccactgcc 


ccagttcccc 


agaccatcat 


ggacctcttc 


cagaatggga 


actggacaat 


a con 

4680 


gcagaaccct 


tcacctgcat 


gccagtgtag 


cagcgacaaa 


atcaagaaga 


tgctgcctgt 


4740 


gtgtccccca 


ggggcagggg 


ggctgcctcc 


tccacaaaga 


aaacaaaaca 


ctgcagatat 


4800 


ccttcaggac 


ctgacaggaa 


gaaacatttc 


ggattatctg 


gtgaagacgt 


atgtgcagat 


4860 


catagccaaa 


agcttaaaga 


acaagatctg 


ggtgaatgag 


tttaggtatg 


gcggcttttc 


4920 


cctgggtgtc 


agtaatactc 


aagcacttcc 


tccgagtcaa 


gaagttaatg 


atgccatcaa 


i no A 

4980 


acaaatgaag 


aaacacctaa 


agctggccaa 


ggacagttct 


gcagatcgat 


ttctcaacag 


5040 


cttgggaaga 


tttatgacag 


gactggacac 


caaaaataat 


gtcaaggtgt 


ggttcaataa 


5100 


caagggctgg 


catgcaatca 


gctctttcct 


gaatgtcatc 


aacaatgcca 


ttctccgggc 


5160 


caacctgcaa 


aagggagaga 


accctagcca 


ttatggaatt 


actgctttca 


atcatcccct 


5220 


gaatctcacc 


aagcagcagc 


tctcagaggt 


ggctctgatg 


accacatcag 


tggatgtcct 


noA 
5280 


tgtgtccatc 


tgtgtcatct 


ttgcaatgtc 


cttcgtccca 


gccagctttg 


tcgtattcct 


5340 


gatccaggag 


cgggtcagca 


aagcaaaaca 


cctgcagttc 


atcagtggag 


tgaagcctgt 


5400 


catctactgg 


ctctctaatt 


ttgtctggga 


tatgtgcaat 


tacgttgtcc 


ctgccacact 


5460 


ggtcattatc 


atcttcatct 


gcttccagca 


gaagtcctat 


gtgtcctcca 


ccaatctgcc 


5520 


tgtgctagcc 


cttctacttt 


tgctgtatgg 


gtggtcaatc 


acacctctca 


tgtacccagc 


r poa 

5580 


ctcctttgtg 


ttcaagatcc 


ccagcacagc 


ctatgtggtg 


ctcaccagcg 


tgaacctctt 


5640 


cattggcatt 


aatggcagcg 


tggccacctt 


tgtgctggag 


ctgttcaccg 


acaataagct 


5700 


gaataatatc 


aatgatatcc 


tgaagtccgt 


gttcttgatc 


ttcccacatt 


tttgcctggg 


5760 


acgagggctc 


atcgacatgg 


tgaaaaacca 


ggcaatggct 


gatgccctgg 


aaaggtttgg 


5820 


ggagaatcgc 


tttgtgtcac 


cattatcttg 


ggacttggtg 


ggacgaaacc 


tcttcgccat 


r o o f\ 

5880 


ggccgtggaa 


ggggtggtgt 


tcttcctcat 


tactgttctg 


atccagtaca 


gattcttcat 


5940 


caggcccaga 


cctgtaaatg 


caaagctatc 


tcctctgaat 


gatgaagatg 


aagatgtgag 


6000 


gcgggaaaga 


cagagaattc 


ttgatggtgg 


aggccagaat 


gacatcttag 


aaatcaagga 


6060 


gttgacgaag 


atatatagaa 


ggaagcggaa 


gcctgctgtt 


gacaggattt 


gcgtgggcat 


6120 


tcctcctggt 


gagtgctttg 


ggctcctggg 


agttaatggg 


gctggaaaat 


catcaacttt 


6180 


caagatgtta 


acaggagata 


ccactgttac 


cagaggagat 


gctttcctta 


acaaaaatag 


6240 


tatcttatca 


aacatccatg 


aagtacatca 


gaacatgggc 


tactgccctc 


agtttgatgc 


6300 


catcacagag 


ctgttgactg 


ggagagaaca 


cgtggagttc 


tttgcccttt 


tgagaggagt 


r t r a 

6360 


cccagagaaa 


gaagttggca 


aggttggtga 


gtgggcgatt 


cggaaactgg 


gcctcgtgaa 


6420 


gtatggagaa 


aaatatgctg 


gtaactatag 


tggaggcaac 


aaacgcaagc 


tctctacagc 


6480 


catggctttg 


atcggcgggc 


ctcctgtggt 


gtttctggat 


gaacccacca 


caggcatgga 


6540 


tcccaaagcc 


cggcggttct 


tgtggaattg 


tgccctaagt 


gttgtcaagg 


aggggagatc 


6600 


agtagtgctt 


acatctcata 


gtatggaaga 


atgtgaagct 


ctttgcacta 


ggatggcaat 


6660 
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catggtcaat 


ggaaggttca 


ggtgccttgg 


cagtgtccag 


catctaaaaa 


ataggtttgg 


C7")A 

O/^U 


agatggttat 


acaatagttg 


tacgaatagc 


agggtccaac 


ccggacctga 


agcctgtcca 


C70A 

D/oU 


ggatttcttt 


ggacttgcat 


ttcctggaag 


tgttctaaaa 


gagaaacacc 


ggaacatgct 


Do4U 


acaataccag 


ct tccatct t 


cattatcttc 


tctggccagg 


atattcagca 


xccLCtccca 




gagcaaaaag 


cgactccaca 


tagaagacta 


ctctgtttct 


cagacaacac 


xxgaccaagt 




atttgtgaac 


tttgccaagg 


accaaagtga 


tgatgaccac 


ttaaaagacc 


tc tcattaca 


/U^U 


caaaaaccag 


acagtagtgg 


acgttgcagt 


tctcacatct 


tttctacagg 


atgagaaagt 


/UoU 


gaaagaaagc 


tatgtatgaa 


gaatcctgtt 


catacggggt 


ggctgaaagt 


aaagaggaac 


/ ±4U 


tagactttcc 


tttgcaccat 


gtgaagtgtt 


gtggagaaaa 


gagccagaag 


ttgatgtggg 


/ ZUU 


aagaagtaaa 


ctggatactg 


tactgatact 


attcaatgca 


atgcaattca 


atgcaatgaa 




aacaaaattc 


cattacaggg 


gcagtgcctt 


tgtagcctat 


gtcttgtatg 


gctctcaagt 




gaaagacttg 


aatttagttt 


tttacctata 


cctatgtgaa 


actctattat 


ggaacccaat 


73QA 


ggacatatgg 


gtttgaactc 


acactttttt 


A ■ j j j ^ ^_ .A_ ^ 

tttttttttt 


gttcctgtgt 


attctcattg 


744U 


gggttgcaac 


aataattcat 


caagtaatca 


tggccagcga 


ttattgatca 


aaatcaaaag 


/dUU 


gtaatgcaca 


tcctcattca 


ctaagccatg 


ccatgcccag 


gagactggtt 


tcccggtgac 


/ DbU 


acatccattg 


ctggcaatga 


gtgtgccaga 


gttattagtg 


ccaagttttt 


cagaaagttt 


7C 7 r\ 


gaagcaccat 


ggtgtgtcat 


gctcactttt 


gtgaaagctg 


ctctgctcag 


agtctatcaa 


/DOU 


cattgaatat 


cagttgacag 


aatggtgcca 


tgcgtggcta 


acatcctgct 


ttgattccct 


77/1 n 
//4U 


ctgataagct 


gttctggtgg 


cagtaacatg 


caacaaaaat 


gtgggtgtct 


ccaggcacgg 


/oUU 


gaaacttggt 


tccattgtta 


tattgtccta 


tgcttcgagc 


catgggtcta 


cagggtcatc 


/obU 


cttatgagac 


tcttaaatat 


acttagatcc 


tggtaagagg 


caaagaatca 


acagccaaac 




tgctggggct 


gcaagctgct 


gaagccaggg 


catgggatta 


aagagattgt 


gcgttcaaac 


/you 


ctagggaagc 


ctgtgcccat 


ttgtcctgac 


tgtctgctaa 


catggtacac 


tgcatctcaa 


oU4U 


gatgtttatc 


tgacacaagt 


gtattatttc 


tggctttttg 


aattaatcta 


gaaaatgaaa 


81UU 


agatggagtt 


gtattttgac 


aaaaatgttt 


gtacttttta 


atgttatttg 


gaattttaag 


olbU 


ttctatcagt 


gacttctgaa 


tccttagaat 


ggcctctttg 


tagaaccctg 


tggtatagag 


ozzU 


gagtatggcc 


actgccccac 


tatttttatt 


ttcttatgta 


agtttgcata 


tcagtcatga 


ozoU 


ctagtgccta 


gaaagcaatg 


tgatggtcag 


gatctcatga 


cattatattt 


gagtttcttt 


034U 


cagatcattt 


aggatactct 


taatctcact 


tcatcaatca 


aatatttttt 


gagtgtatgc 


o4UU 


tgtagctgaa 


agagtatgta 


cgtacgtata 


agactagaga 


gatattaagt 


ctcagtacac 


o4bU 


ttcctgtgcc 


atgttattca 


gctcactggt 


ttacaaatat 


aggttgtctt 


gtggttgtag 


8520 


gagcccactg 


taacaatact 


gggcagcctt 


tttttttttt 


tttttaattg 


caacaatgca 


8580 


aaagccaaga 


aagtataagg 


gtcacaagtc 


taaacaatga 


attcttcaac 


agggaaaaca 


8640 


gctagcttga 


aaacttgctg 


aaaaacacaa 


cttgtgttta 


tggcatttag 


taccttcaaa 


8700 
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taattggctt 


tgcagatatt 


ggatacccca 


ttaaatctga 


cagtctcaaa 


tttttcatct 


O "7CA 


cttcaatcac 


tagtcaagaa 


aaatataaaa 


acaacaaata 


cttccatatg 


gagcattttt 


ooZO 


cagagttttc 


taacccagtc 


ttatttttct 


agtcagtaaa 


catttgtaaa 


aatactgttt 


8880 


cactaatact 


tactgttaac 


tgtcttgaga 


gaaaagaaaa 


atatgagaga 


actattgttt 


8940 


ggggaagttc 


aagtgatctt 


tcaatatcat 


tactaacttc 


ttccactttt 


tccagaattt 


9000 


gaatattaac 


gctaaaggtg 


taagacttca 


gatttcaaat 


taatctttct 


atatttttta 


9060 


aatttacaga 


atattatata 


acccactgct 


gaaaaagaaa 


aaaatgattg 


ttttagaagt 


9120 


taaagtcaat 


attgatttta 


aatataagta 


atgaaggcat 


atttccaata 


actagtgata 


9180 


tggcatcgtt 


gcattttaca 


gtatcttcaa 


aaatacagaa 


tttatagaat 


aatttctcct 


9240 


catttaatat 


ttttcaaaat 


caaagttatg 


gtttcctcat 


tttactaaaa 


tcgtattcta 


9300 


attcttcatt 


atagtaaatc 


tatgagcaac 


tccttacttc 


ggttcctctg 


atttcaaggc 


9360 


catattttaa 


aaaatcaaaa 


ggcactgtga 


actattttga 


agaaaacaca 


acattttaat 


9420 


acagattgaa 


aggacctctt 


ctgaagctag 


aaacaatcta 


tagttataca 


tcttcattaa 


9480 


tactgtgtta 


ccttttaaaa 


tagtaatttt 


ttacattttc 


ctgtgtaaac 


ctaattgtgg 


9540 


tagaaatttt 


taccaactct 


atactcaatc 


aagcaaaatt 


tctgtatatt 


ccctgtggaa 


9600 


tgtacctatg 


tgagtttcag 


aaattctcaa 


aatacgtgtt 


caaaaatttc 


tgcttttgca 


9660 


tctttgggac 


acctcagaaa 


acttattaac 


aactgtgaat 


atgagaaata 


cagaagaaaa 


9720 


taataagccc 


tctatacata 


aatgcccagc 


acaattcatt 


gttaaaaaac 


aaccaaacct 


ATOA 

9780 


cacactactg 


tatttcatta 


tctgtactga 


aagcaaatgc 


tttgtgacta 


ttaaatgttg 


9840 


cacatcattc 


attcactgta 


tagtaatcat 


tgactaaagc 


catttgtctg 


tgttttcttc 


9900 


ttgtggttgt 


atatatcagg 


taaaatattt 


tccaaagagc 


catgtgtcat 


gtaatactga 


9960 


accactttga 


tattgagaca 


ttaatttgta 


cccttgttat 


tatctactag 


taataatgta 


10020 


atactgtaga 


aatattgctc 


taattctttt 


caaaattgtt 


gcatccccct 


tagaatgttt 


10080 


ctatttccat 


aaggatttag 


gtatgctatt 


atcccttctt 


ataccctaag 


atgaagctgt 


10140 


ttttgtgctc 


tttgttcatc 


attggccctc 


attccaagca 


ctttacgctg 


tctgtaatgg 


10200 


gatctatttt 


tgcactggaa 


tatctgagaa ttgcaaaact 


agacaaaagt 


ttcacaacag 


10260 


atttctaagt 


taaatcattt 


tcattaaaag 


gaaaaaagaa 


aaaaaatttt 


gtatgtcaat 


10320 


aactttatat 


gaagtattaa 


aatgcatatt 


tctatgttgt 


aatataatga 


gtcacaaaat 


10380 


aaagctgtga 


cagttctgtt 


ggtctacaga 


aa 






10412 
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<400> 8 
ttctttccaa 

aaacctgggc 

5 

ctcataagtg 
tgcctgccct 
gccctctctg 

10 

tctcttggcc 
ggccgcagag 
tcacctgcgt 

15 

tgggacactg 
cgtgcatgta 
agactgcgtg 

20 

gccgatgtct 
cgccagtgcg 
ggtggtcgct 

25 

ttgggggttt 
cgggaagttc 
cacgctgatg 

30 

caacggcctg 
tgacatgctg 
gcttcaggag 

35 

cttgctgtct 
ggccatcgcg 
cggcctggac 

40 

gggtcgctcc 
ccagctttac 
gccatatttg 

45 

catggaggtt 
ggagggcatg 
ttttctttgg 

50 

aaaggactcc 
ctgcatcctc 
gcgcatcacc 

55 

gaacgaagcc 



gggtctctgg gtgaggcccg 
gtgatatacc tcccttttag 
gtttcccata caagaaagat 
cccgccagga ggtggtcttc 
ctggggctcc gggacatggt 
agactgtggt gtctggtccc 
tatccacacc ctggctgggt 
gcctgggtga tgagaaataa 
acccatgaag agaaagcagt 
caggtggcag ctttagcttc 
tcctgcaaaa tcgaaaatgt 
ggcctacagg gctcaagcat 
gcatcctgca cgtggtcagt 
atcagtggca aagtcttgtc 
cctggcgatc ccatggagga 
aatagtggtg agttggtggc 
aacatcctgg ctggatacag 
ccccgggacc tgcgctgctt 
ctgccgcatc tcactgtgca 
aaggatgaag gcagaaggga 
tgcgccaaca cgcggaccgg 
ctggagctgg tgaacaaccc 
agcgcctcct gcttccaggt 
atcatttgca ccatccacca 
gtcctgagtc aaggacaatg 
agggatttgg gtctgaactg 
gcatccggcg agtacggtga 
tgtgactcag accacaagag 
caccggccct ctgaagaggt 
tcgtccatgg aaggctgcca 
ttcaagagga ccttcctcag 
tcgcacattg ggatcggcct 
aagaaggtct tgagcaactc 



tgaccttccc aagcctctcc 
ggctgctgcg atcatttagg 
gctagcagtg caacagacag 
caacttttgc ccggagtcta 
caggagaggt tggtctgtct 
tactcacacc ttcctgtcag 
gtactggcta cagatggctg 
tgtcacaaca aatatcccat 
tctcacagtt tttacaggca 
tgctaaactg gagagctcag 
ctgtgattca gctcttcagg 
tgtcatcatg cccccatcca 
ccaagtccag ggagggcccc 
agcagctcat ggggcaggaa 
aggatacaag accctcctga 
cattatgggt ccttccgggg 
ggagacgggc atgaaggggg 
ccggaaggtg tcctgctaca 
ggaggccatg atggtgtcgg 
aatggtcaag gagatactga 
gagcctgtca ggtggtcagc 
tccagtcatg ttcttcgatg 
ggtctcgctg atgaaagggc 
gcccagcgcc aaactcttcg 
tgtgtaccgg ggaaaagtct 
cccaacctac cacaacccag 
tcagaacagt cggctggtga 
agacctcggg ggtgatgccg 
aaagcagaca aaacgattaa 
cagcttctct gccagctgcc 
catcatgagg gactcggtcc 
cctcattggc ctgctgtact 
cggcttcctc ttcttctcca 



ct:at:ct:t:at:a 


60 


cagattaaac 


120 


aacacttacc 


180 


v-«y «*yyy L yy 


240 


o"taccocca"t 


300 


aatatrr aaa 


360 


"taactccaaa 


420 


ctacattctc 


480 


caaccacaac 


540 


1" at" 1" 1" a 1" a a c 


600 


aaaaaaaaaT 


660 




720 


atcacctaaa 


780 




840 


aannaatttr 


900 


v_ \_ y y y a ca y i_v_ 


960 


ccaticct'cat" 

v. \_ y v- v. v. v* v_ u. 


1020 


Lv.a i_ y i_ a. y y ci 


1080 


raratrtnaa 

LuLClLV. L y Cld 


1140 


ranrnrtnnn 

Lay v_y l uyyy 


1200 

-L £- \J\J 


acaaaracrt 

y v_ uay v_ y v_ v_ \_ 


1260 


aacccaccaa 

ay v. v_ \_ u v. v_ c*y 


1320 


tcactraaaa 

i_ v_ y v_ v- uuy y 


1380 


aortottroa 

a. y v_ u.y n-yo 


1440 


n r a a 1" r 1" 1" n t" 


1500 


caaattttat: 

v.uuu v. t w y v. 


1560 




1620 


oy y uy &CIV-V-V_ 


1680 


aggggttgag 


1740 


tcacgcagtt 


1800 


tgacacacct 


1860 


tggggatcgg 


1920 


tgctgttcct 


1980 
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catgttcgcg 


gccctcatgc 


ctactgttct 


gacatttccc 


ctggagatgg 


gagtctttct 


2040 




tcgggaacac 


ctgaactact 


ggtacagcct 


gaaggcctac 


tacctggcca 


agaccatggc 


2100 


5 


agacgtgccc 


tttcagatca 


tgttcccagt 


ggcctactgc 


agcatcgtgt 


actggatgac 


2160 




gtcgcagccg 


tccgacgccg 


tgcgctttgt 


gctgtttgcc 


gcgctgggca 


ccatgacctc 


2220 




cctggtggca 


cagtccctgg 


gcctgctgat 


cggagccgcc 


tccacgtccc 


tgcaggtggc 


2280 


10 


cactttcgtg 


ggcccagtga 


cagccatccc 


ggtgctcctg 


ttctcggggt 


tcttcgtcag 


2340 




cttcgacacc 


atccccacgt 


acctacagtg 


gatgtcctac 


atctcctatg 


tcaggtatgg 


2400 




gttcgaaggg 


gtcatcctct 


ccatctatgg 


cttagaccgg 


gaagatctgc 


actgtgacat 


2460 


15 


cgacgagacg 


tgccacttcc 


agaagtcgga 


ggccatcctg 


cgggagctgg 


acgtggaaaa 


2520 




tgccaagctg 


tacctggact 


tcatcgtact 


cgggattttc 


ttcatctccc 


tccgcctcat 


2580 




tgcctatttt 


gtcctcaggt 


acaaaatccg 


ggcagagagg 


taaaacacct 


gaatgccagg 


2640 


20 


aaacaggaag 


attagacact 


gtggccgagg 


gcacgtctag 


aatcgaggag 


gcaagcctgt 


2700 




gcccgaccga 


cgacacagag 


actcttctga 


tccaacccct 


agaaccgcgt 


tgggtttgtg 


2760 




ggtgtctcgt 


gctcagccac 


tctgcccagc 


tgggttggat 


cttctctcca 


ttcccctttc 


2820 


25 


tagctttaac 


taggaagatg 


taggcagatt 


ggtggttttt 


ttttttttaa 


catacagaat 


2880 




tttaaatacc 


acaactgggg 


cagaatttaa 


agctgcaaca 


cagctggtga 


tgagaggctt 


2940 




cctcagtcca 


gtcgctcctt 


agcaccaggc 


accgtgggtc 


ctggatgggg 


aactgcaagc 


3000 


30 


agcctctcag 


ctgatggctg 


cacagtcaga 


tgtctggtgg 


cagagagtcc 


gagcatggag 


3060 




cgattccatt 


ttatgactgt 


tgtttttcac 


attttcatct 


ttctaaggtg 


tgtctctttt 


3120 




ccaatgagaa 


gtcatttttg 


caagccaaaa 


gtcgatcaat 


cgcattcatt 


ttaagaaatt 


3180 


35 


ataccttttt 


agtacttgct 


gaagaatgat 


tcagggtaaa 


tcacatactt 


tgtttagaga 


3240 




yy i^yciyyyy l 




v_ ci v_ i_ v_ a. y v_ i_ y 


nt* rl" ra 1" a 


i_ciyciv_ciyi_civ_ 


1" 1" n 1" n a a n n a 

i_ l y i_y aciyyci 






ttgaatgcag 


gttccaggtg 


gagggaagac 


gtggacacca 


tctccactga 


gccatgcaga 


3360 


a n 


catttttaaa 


agctatacaa 


aaaattgtga 


gaagacattg 


gccaactctt 


tcaaagtctt 


3420 




tctttttcca 


cgtgcttctt 


attttaagcg 


aaatatattg 


tttgtttctt 


cct 


3473 




<210> 9 














45 


<211> 2740 














<212> DNA 














50 


<213> Homo sapiens 














<400> 9 
aagtcccagt 


cctgctgtcc 


caagggactc 


cggggtcagg 


tggagcaggc 


agggcagtct 


60 


55 


gccacgggct 


ccccaactga 


agccactctg 


gggagggtcc 


ggccaccaga 


aaatttgccc 


120 




agctttgctg 


cctgttggcc 


atgggtgacc 


tctcatcttt 


gacccccgga 


gggtccatgg 


180 
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gtctccaagt 


aaacagaggc 


tcccagagct 


ccctggaggg 


ggctcctgcc 


accgccccgg 


240 




agcctcacag 


cctgggcatc 


ctccatgcct 


cctacagcgt 


cagccaccgc 


gtgaggccct 


300 


5 


ggtgggacat 


cacatcttgc 


cggcagcagt 


ggaccaggca 


gatcctcaaa 


gatgtctcct 


360 




tgtacgtgga 


gagcgggcag 


atcatgtgca 


tcctaggaag 


ctcaggctcc 


gggaaaacca 


420 




cgctgctgga 


cgccatgtcc 


gggaggctgg 


ggcgcgcggg 


gaccttcctg 


ggggaggtgt 


480 


10 


atgtgaacgg 


ccgggcgctg 


cgccgggagc 


agttccagga 


ctgcttctcc 


tacgtcctgc 


540 




agagcgacac 


cctgctgagc 


agcctcaccg 


tgcgcgagac 


gctgcactac 


accgcgctgc 


600 




tggccatccg 


ccgcggcaat 


cccggctcct 


tccagaagaa 


ggtggaggcc 


gtcatggcag 


660 


15 


agctgagtct 


gagccatgtg 


gcagaccgac 


tgattggcaa 


ctacagcttg 


gggggcattt 


720 




ccacgggtga 


gcggcgccgg 


gtctccatcg 


cagcccagct 


gctccaggat 


cctaaggtca 


780 




tgctgtttga 


tgagccaacc 


acaggcctgg 


actgcatgac 


tgctaatcag 


attgtcgtcc 


840 


20 


tcctggtgga 


actggctcgc 


aggaaccgaa 


ttgtggttct 


caccattcac 


cagccccgtt 


900 




ctgagctttt 


tcagctcttt 


gacaaaattg 


ccatcctgag 


cttcggagag 


ctgattttct 


960 




gtggcacgcc 


agcggaaatg 


cttgatttct 


tcaatgactg 


cggttaccct 


tgtcctgaac 


1020 


25 


attcaaaccc 


ttttgacttc 


tatatggacc 


tgacgtcagt 


ggatacccaa 


agcaaggaac 


1080 




gggaaataga 


aacctccaag 


agagtccaga 


tgatagaatc 


tgcctacaag 


aaatcagcaa 


1140 




tttgtcataa 


aactttgaag 


aatattgaaa 


gaatgaaaca 


cctgaaaacg 


ttaccaatgg 


1200 


30 


ttcctttcaa 


aaccaaagat 


tctcctggag 


ttttctctaa 


actgggtgtt 


ctcctgagga 


1260 




gagtgacaag 


aaacttggtg 


agaaataagc 


tggcagtgat 


tacgcgtctc 


cttcagaatc 


1320 




tgatcatggg 


tttgttcctc 


cttttcttcg 


ttctgcgggt 


ccgaagcaat 


gtgctaaagg 


1380 


35 


gtgctatcca 


ggaccgcgta 


ggtctccttt 


accagtttgt 


gggcgccacc 


ccgtacacag 


1440 




gcatgctgaa 


cgctgtgaat 


ctgtttcccg 


tgctgcgagc 


tgtcagcgac 


caggagagtc 


1500 




aggacggcct 


ctaccagaag 


tggcagatga 


tgctggccta 


tgcactgcac 


gtcctcccct 


1560 


40 


tcagcgttgt 


tgccaccatg 


attttcagca 


gtgtgtgcta 


ctggacgctg 


ggcttacatc 


1620 




ctgaggttgc 


ccgatttgga 


tatttttctg 


ctgctctctt 


ggccccccac 


ttaattggtg 


1680 




aatttctaac 


tcttgtgcta 


cttggtatcg 


tccaaaatcc 


aaatatagtc 


aacagtgtag 


1740 


45 


tggctctgct 


gtccattgcg 


ggggtgcttg 


ttggatctgg 


attcctcaga 


aacatacaag 


1800 




aaatgcccat 


tccttttaaa 


atcatcagtt 


attttacatt 


ccaaaaatat 


tgcagtgaga 


1860 




ttcttgtagt 


caatgagttc 


tacggactga 


atttcacttg 


tggcagctca 


aatgtttctg 


1920 


50 


tgacaactaa 


tccaatgtgt 


gccttcactc 


aaggaattca 


attcattgag 


aaaacctgcc 


1980 




caggtgcaac 


atctagattc 


acaatgaact 


ttctgatttt 


gtattcattt 


attccagctc 


2040 




ttgtcatcct 


aggaatagtt 


gttttcaaaa 


taagggatca 


tctcattagc 


aggtagtgaa 


2100 


55 


agccatggct 


gggaaaatgg 


aagtgaagct 


gccgactgtg 


catgactgct 


ctgaacgtct 


2160 




gaaatgagag 


tgccatgtat 


ttctttcttg 


acaggacatc 


tcaagtcttt 


taaccattaa 


2220 
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gactccattt gtgcctcttg gatccaagca 

tcccttgctc ttacaacttg cagggacatg 

ccaagaatgt aaataatatt cataaaccta 

tgttctaggc acagaaaaaa ataggtcagc 

taggcaaaaa taaaatgttt caaggattcc 

gggtttagat gaagcaaggt tatcatgcag 

gacccagagt taaatgcaga tgtccacacg 

gagtgctggg cagatggatg gtgagcacca 

ggattaaaat aaaaaaccgt aggaatgggc 

<210> 10 

<211> 2679 

<212> DNA 

<213> Homo sapiens 



<400> 10 
ctccaggaaa 


caqaqtqaaq 


acactggccc 


gcagcccagg 


gtcacagacc 


tgtgggcccc 


ctgccgaaag 


gggccactcc 


ccaggatacc 


gaaagtgaca 


acagcctgta 


cttcacctac 


gacctcaact 


gccaggtgga 


cctggcctct 


ttcaagatgc 


cctggacatc 


tcccagctgc 


ctaagcttca 


aagtgagaag 


tgggcagatg 


agagcctcct 


tgctagatgt 


gatcactggc 


cagatctgga 


tcaatgggca 


gcccagctcg 


gtgcgccagc 


acaaccagct 


gctccccaac 


gcccagatgc 


ggctgcccag 


aaccttctcc 


gtgatcgcgg 


agctgcggct 


taggcagtgc 


cgggggttgt 


cggggggtga 


gcgcaggaga 


ccaggaatcc 


ttattctcga 


cgaacccacc 


ctggtgaaga 


ccttgtccag 


gctggccaaa 


cagcctcgct 


ctgacatctt 


caggctgttt 


cccatctact 


taggggcggc 


ccagcacatg 


tgtcctcgct 


acagcaatcc 


tgctgacttc 


agcagagagc 


aggaattggc 


caccagggag 



ggccttgaat 


gcaatggaag 


tggtttatag 




tggttatttg 


gaaattgtga 


ctgagcggac 


"> 3 A A 

Z34U 


tgggagactc 


gtgtgactat 


tttttttcct 


T /I AA 


ttaaaaatat 


gtttacattg 


gataaaggat 


^4bU 


tgaccataag 


tgacagagaa 


agagagttgt 


ZbZv 


aattgggtaa 


gaatgcttct 


gttcctggaa 


2580 


aggggtcgga 


gttacctgat 


cacatcgaga 


2640 


ctgctacaga 


gcacccagtg 


attttactga 


2700 


tcaacagtga 






2740 



tggcaggcag 


cagctgggtc 


taagagagct 


60 


atggccggga 


aggcggcaga 


ggagagaggg 


120 


tcgggcctcc 


aggatagatt 


gttctcctct 


180 


agtggccagc 


ccaacaccct 


ggaggtcaga 


240 


caggtccctt 


ggtttgagca 


gctggctcag 


300 


cagaattctt 


gtgagctggg 


catccagaac 


360 


ctggccatca 


tagggagctc 


aggttgtggg 


420 


cgaggtcacg 


gcggcaagat 


caagtcaggc 


480 


cctcagctgg 


tgaggaagtg 


tgtggcccac 


540 


ttgactgtgc 


gagagacctt 


ggccttcatt 


600 


caggcccagc 


gtgacaaaag 


ggtggaggac 


660 


gctgacaccc 


gcgtgggcaa 


catgtacgtg 


720 


gtcagcattg 


gggtgcagct 


cctgtggaac 


780 


tctgggctcg 


acagcttcac 


agcccacaac 


840 


ggcaaccggc 


tggtgctcat 


ctccctccac 


900 


gatctggtcc 


tcctgatgac 


gtctggcacc 


960 


gtccagtatt 


tcacagccat 


cggctacccc 


1020 


tatgtggacc 


tgaccagcat 


tgacaggcgc 


1080 


aaggctcagt 


cactcgcagc 


cctgtttcta 


1140 
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gaaaaagtgc 


gtgacttaga 


tgactttcta 


tggaaagcag 


agacgaagga 


xcrrgacgag 


XZUU 




gacacctgtg 


tggaaagcag 


cgtgacccca 


ctagacacca 


actgcctccc 


gagtcctacg 


±£OU 


5 


aagatgcctg 


gggcggtgca 


gcagtttacg 


acgctgatcc 


gtcgtcagat 


ttccaacgac 


i 3 *>n 




ttccgagacc 


tgcccaccct 


cctcatccat 


ggggcggagg 


cctgtctgat 


gxcaa ugacc 


IjoU 




atcggcttcc 


tctattttgg 


ccatgggagc 


atccagctct 


ccttcatgga 


tacagccgcc 


1 AAC\ 


10 


ctcttgttca 


tgatcggtgc 


tctcatccct 


ttcaacgtca 


ttctggatgt 


ca tc rccaaa 


XjUU 




tgttactcag 


agagggcaat 


gctttactat 


gaactggaag 


acgggctgta 


caccactggt 






ccatatttct 


ttgccaagat 


cctcggggag 


cttccggagc 


actgtgccta 


catcatcatc 




15 


tacgggatgc 


ccacctactg 


gctggccaac 


ctgaggccag 


gcctccagcc 


cttcctgctg 


t £cn 




cacttcctgc 


tggtgtggct 


ggtggtcttc 


tgttgcagga ttatggccct 


ggccgccgcg 


_L/ 4U 




gccctgctcc 


ccaccttcca 


catggcctcc 


ttcttcagca atgccctcta 


caac tec Ltc 


loUU 


20 


tacctcgccg 


ggggcttcat 


gataaacttg 


agcagcctgt 


ggacagtgcc 


cgcgtggatt 


XobU 




tccaaagtgt 


ccttcctgcg 


gtggtgtttt 


gaagggctga tgaagattca 


gttcagcaga 






agaacttata 


aaatgcctct 


cgggaacctc 


accatcgcgg 


tctcaggaga 


taaaatcctx 


jLyoii 


25 


agtgccatgg 


agctggactc 


gtaccctctc 


tacgccatct 


acctcatcgt 


cattggcctc 


~>r\A n 




agcggtggct 


tcatggtcct 


gtactacgtg 


tccttaaggt 


tcatcaaaca 


gaaaccaagt 


*>i nn 




caagactggt 


gattcacgcc 


agacgtctgc 


ccgctggtgg 


gggacctgag 


cagacccttc 


*>i £n 


30 


aactgcactc 


cctcctcagg 


agccccttcc 


tggggacagt 


gaggacaatg 


accctacaga 


">"> on 




tgctcagcta 


catccggccc 


agggtgctgc 


ggtggcacag 


accagccaca 


ggatggcagt 






agaataaaga 


cagtcgaaag 


ggatttctgc 


tcactggcag 


gagactgcga 


tgactgggag 


Z34U 


35 


aaaacctgca 


ctcggtggca 


cctacaacgt 


tgctaattta 


tttccttttg 


atargcaLt t 






atataggcaa 


ctcgatatag 


gatgggagca 


aactaggaat 


gaattgggta 


gctagactgt 






gcaggaattg 


ttggaacctg 


gagggaacaa 


taacagtacc 


tagcagattt 


ggcttcatct 


2520 


40 


tccaggggcc 


ccacactccg 


tggtgagcca 


ccatcaatac 


agaaagtgac 


ctaagatgta 


2580 


ccagcaagat 


gccatccctt 


ctttttgtgt 


ggggtcatgg 


gctccaaaag 


ccaacgtgaa 


2640 




caattaaaaa 


tgtattgagc 


atctaaaaaa 


aaaaaaaaa 






2679 



45 <210> 11 

<211> 1156 

<212> DNA 

50 <213> Homo sapiens 



<400> 11 

egcageggag gtgaaggacg tccttcccca ggagecgact ggccaatcac aggcaggaag 60 

55 

atgaaggttc tgtgggctgc gttgctggtc acattcctgg caggatgeca ggccaaggtg 120 



41 



gagcaagcgg 


tggagacaga 


gccggagccc 


ggccagcgct 


gggaactggc 


actgggtcgc 


ctgtctgagc 


aggtgcagga 


ggagctgctc 


ctgatggacg 


agaccatgaa 


ggagttgaag 


accccggtgg 


cggaggagac 


gcgggcacgg 


cggctgggcg 


cggacatgga 


ggacgtgtgc 


caggccatgc 


tcggccagag 


caccgaggag 


aagctgcgta 


agcggctcct 


ccgcgatgcc 


caggccgggg 


cccgcgaggg 


cgccgagcgc 


cccctggtgg 


aacagggccg 


cgtgcgggcc 


ctacaggagc 


gggcccaggc 


ctggggcgag 


agccggaccc 


gcgaccgcct 


ggacgaggtg 


ctggaggagc 


aggcccagca 


gatacgcctg 


agctggttcg 


agcccctggt 


ggaagacatg 


gtgcaggctg 


ccgtgggcac 


cagcgccgcc 


aagcctgcag 


ccatgcgacc 


ccacgccacc 


cgggagaccc 


tgtccccgcc 


ccagccgtcc 


tcaccaagtt 


tcacgc 





30 

<210> 12 

<211> 417 

35 <212> DNA 

<213> Homo sapiens 



<400> 12 
acctcccaac 


caagccctcc 


agcaaggatt 


ctcttcctgt 


cgctcccggt 


cctggtggtg 


ccagcccagg 


ggaccccaga 


cgtctccagt 


acactggagg 


acaaggctcg 


ggaactcatc 


aagatgcggg 


agtggttttc 


agagacattt 


tcatgaggac 


ctgaagggtg 


acatccagga 


cctgtgctga 


ggactcccgc 


catgtggccc 



<210> 13 
<211> 753 
<212> DNA 
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gagctgcgcc 


agcagaccga 


gtggcagagc 


loU 


ttttgggat t 


acctgcgctg 


ggtgcagaca 


~l Af\ 

24U 


agctcccagg 


tcacccagga 


actgagggcg 


dUU 


gcctacaaat 


cggaactgga 


ggaacaactg 


36U 


ctgtccaagg 


agctgcaggc 


ggcgcaggcc 


4zU 


ggccgcctgg 


tgcagtaccg 


cggcgaggtg 


A OA 

48U 


ctgcgggtgc 


gcctcgcctc 


ccacctgcgc 


540 


gatgacctgc 


agaagcgcct 


ggcagtgtac 


600 


ggcctcagcg 


ccatccgcga 


gcgcctgggg 


660 


gccactgtgg 


gctccctggc 


cggccagccg 


~j ~i r\ 

720 


cggctgcgcg 


cgcggatgga 


ggagatgggc 


780 


aaggagcagg 


tggcggaggt 


gcgcgccaag 


840 


caggccgagg 


ccttccaggc 


ccgcctcaag 


900 


cagcgccagt 


gggccgggct 


ggtggagaag 


960 


cctgtgccca 


gcgacaatca 


ctgaacgccg 


1020 


ccgtgcctcc 


tgcctccgcg 


cagcctgcag 


1080 


tcctggggtg 


gaccctagtt 


taataaagat 


1140 








1156 


caggagtgcc 


cctcgggcct 


cgccatgagg 


60 


gttctgtcga 


tcgtcttgga 


aggcccagcc 


120 


gccttggata 


agctgaagga 


gtttggaaac 


180 


agccgcatca 


aacagagtga 


actttctgcc 


240 


cagaaagtga 


aggagaaact 


caagattgac 


300 


ggggcctctg 


aaatttccca 


caccccagcg 


360 


caggtgccac 


caataaaaat 


cctaccg 


417 
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<400> 13 
gttgtggctg 


tggagcggaa 


gtgggtctca 


accactataa 


atcctctctg 


tgcccgtccg 


60 


gagctggtga 


ggacagcctg 


ccagagtctg 


gtctctggac 


actatgggca 


cacgactcct 


120 


cccagctctg 


tttcttgtcc 


tcctggtatt 


gggatttgag 


gtccagggga 


cccaacagcc 


180 


ccagcaagat 


gagatgccta 


gcccgacctt 


cctcacccag 


gtgaaggaat 


ctctctccag 


240 


ttactgggag 


tcagcaaaga 


cagccgccca 


gaacctgtac 


gagaagacat 


acctgcccgc 


300 


tgtagatgag 


aaactcaggg 


acttgtacag 


caaaagcaca 


gcagccatga 


gcacttacac 


360 


aggcattttt 


actgaccaag 


ttctttctgt 


gctgaaggga 


gaggagtaac 


agccagaccc 


420 


cccatcagtg 


gacaagggga 


gagtccccta 


ctcccctgat 


cccccaggtt 


cagactgagc 


480 


tcccccttcc 


cagtagctct 


tgcatcctcc 


tcccaactct 


agcctgaatt 


cttttcaata 


540 


aaaaatacaa 


ttcaagttgc 


ttctcatgga 


tggcactgct 


tttctgagga 


ctcaagggcc 


600 


aagatggagg 


ggctgactca 


gtccagccaa 


catttaatga 


gcacctactt 


tatgtatgga 


660 


gctctaaccc 


atgggtccat 


gggaataaag 


cagtgaatag 


taacaataaa 


taatcgtaac 


720 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaa 






753 
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so <211> 5950 

<212> DNA 

<213> Homo sapiens 
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<400> 14 
gaattccagg 


ggacatctgt 


tgcgcaccta 


ctgtgctcac 


gctggggcct 


ctgggatgga 


60 


caggattctg 


ccaaggcaga 


catctgggtc 


aagacagtcc 


tgcacagttg 


ttcaggttgt 


120 


ggccaaggtt 


gcgtttgcag 


atttgccatg 


taaaaataca 


ggatgctcag 


ttacatttga 


180 


atttcagatt 


aatagcaaaa 


aaaacttttt 


ggtataattc 


tgaaatattt 


catgggacat 


240 


atttatacta 


aaacgtcatg 


cactgttgat 


ttgaaattca 


aatgtaattg 


ggcctcctct 


300 


atttcgtctg 


gcaagcgtag 


acaaaagaat 


ccagtccagg 


ccaggcgcag 


tggctcaagc 


360 


ctgtaatccc 


agcactttgg 


gaggccgagg 


cgggcggatc 


acgaggtcag 


gagatcgaga 


420 


ccatcctggc 


taacacggtg 


aaaccccgtc 


tctactaaaa 


atacaaaaaa ttagctgggc 


480 


gtggtggcgg 


gtgcctgtag 


tcccagctac 


tcgggaggct 


gaggcaggag 


aatggcgtga 


540 


acctgggagg 


cggagcttgc 


agtgagccga 


gatcgcgcca 


cagcactcca 


gcctgggcga 


600 


cagagccaga 


ctctgtctca 


aaaaaaaaaa 


aaagaatcca 


gtccatagtc 


ccctgagcca 


660 


tgtgccctgg 


ggtgcagctg 


ggtccttcag 


gagaaaaatg 


ctctatttct 


ggcactggga 


720 
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ccgagcctga 


tgtgggtttt 


ttgttggttt 


ttgttgttgt 


tattgttttt 


gagacaaggt 


780 


ctcgctccac 


cacacccggc 


taatttttgt 


atttttagta 


gagacggggt 


ttcactacgt 


o a n 

840 


tggccaggct 


ggtcttgaac 


tcctgacctc 


aagtgagccg 


cctgcctcgg 


cctcccaaag 


AAA 

900 


tgctgggatt 


acaggtggga 


gccaccgccc 


tggccctggg 


cctgatgttg 


atgacctcct 


ATA 

960 


actatgtgca 


cctgcagctc 


tcctgcatag 


gcctcagccg 


tcctgcatga 


ggacactggg 


1 ATA. 

1020 


aggcaggtgc 


tccctatcaa 


ccccgtgtta 


cagttgaaca 


aactgagccc 


cagaaaagaa 


"1 AOA 

1080 


aacgtatttg 


cccaggtcac 


acggtgaaga 


agtgagggat 


tcgaagccca 


ggtccatctg 


1140 


aagccagagt 


cacccagagg 


agaaagagtt 


ggaattgaga 


actcaaggaa 


tgcttggaag 


1200 


tgatcgggct 


cgagcccacc 


taggaagaaa 


cagaggctgg 


agacatgaga 


ctgtgttgct 


1260 


atttcctctc 


atcaaccctt 


gggccctatt 


gaggccctac 


cacaagcctg 


gccctgcagc 


1 Tin 

1320 


ccagtgacta 


ggagaaatta 


gacacaagat 


aataataaca 


gcaatgatct 


tttttttttt 


1380 


tctgagacgg 


agtcttgctc 


tttcgcccag 


gctggactgc 


agtggcgcga 


tctcggctca 


"1 A A f\ 

1440 


atgcaagctc 


cacctcccag 


gttcacgcca 


ttctcctgcc 


tcagcctccc 


gagtagctag 


1500 


gactacaggc 


gcctgccacc 


acgcctggct 


aatttttcat 


atttttagta 


gagatggggt 


1560 


ttcaccgtgt 


tagccaagat 


ggtctcaatc 


tcctgacctc 


gtgatccgcc 


tgcctcggcc 


1620 


tcccaaagtg 


ttggggttac 


aggcatgagc 


caccgcgcct 


ggccaacagc 


aatgatcttt 


1 COA 

1680 


gagcacctat 


attgccagtc 


tccacggtaa 


gagctttctt 


cattttttgt 


tttgttttgt 


1740 


ttcaagacag 


agtcttgctc 


tgtcacccag 


gctggagtgc 


agtggtgtga 


tcgcggctca 


1800 


ctgcagcctt 


cacttcccgg 


gttcaagcca 


ttctcctgcc 


tcagcctccc 


aagtagctgg 


1860 


gattacaggc 


acgcatcact 


acttctggct 


aatttttgta 


tttttagtag 


ggacagggtt 


1920 


tttcaccatg 


ttggccaggt 


tggtctcaaa 


ctcctggcct 


catatgatct 


gcccacctcg 


1980 


gcctcccaaa 


gtgctgggat 


tacaggcgtg 


agccactgcg 


cctttctttg 


tatttgttca 


2040 


agtaatatac 


tgaaatatgt 


actgtgcctc 


ccactttatg 


gaggaggaaa 


ctgaggccag 


2100 


caaatgaggc 


tgtcatggga 


ggtggagaca 


ggatttgaac 


ctgcctcagt 


gcaggaggct 


2160 


caagagcctc 


tgtcttctct 


cagggcactg 


tgtgggaggg 


tgagaaggag 


ggaggcccac 


2220 


agaggcatga 


cctctgattg 


ccactgtcac 


ctgggccctg 


ctctctgaag 


tctctgccaa 


2280 


gcggggaggt 


ggccggggga 


gggccctgct 


ctgtgcagcc 


tcccctcccc 


cggcccgcag 


2340 


agttgagcac 


agagggacag 


aggcacggaa 


cccccagaaa 


tgtccctcct 


cagaaacagg 


2400 


ctccaggccc 


tgcctgccct 


gtgcctctgc 


gtgctggtcc 


tggcctgcat 


tgggggtgag 


2460 


aagaagtggg 


tggagggatg 


tggggcccac 


acctggtggg 


tgtgagtgtg 


gctgtgtgtc 


2520 


ctgtggctct 


gtagccacgt 


gagacatgag 


tacggagtgt 


gtgcgtttca 


tggcgtgcgt 


2580 


atgcatgtgc 


gtgtcgggga 


gtgtgtgtgt 


cggtggctga 


gagtgaagtg 


tgaatgtcac 


2640 


attggtacaa 


actgggatca 


tctgtgtgtg 


tgcacgtgcg 


tgcgtggaag 


tgggagtatg 


2700 


cagtcgtggt 


aaaaaagtgc 


atgtctgtgt 


gcatatgtgt 


atttgtgtgc 


acctgtctct 


2760 
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ctgtggggta 


tgtgtgtgca 


aaatatttga 


gtgtgtggac 


atgtgtgagg 


gggtgagtgt 


2820 




gtgctggtgt 


gtacgtctgt 


gttttgcata 


tgcatttttt 


tttttttttt 


ttagacggag 


2880 


5 


tctcactctg 


tcacccaggc 


tggagtgcag 


tggtagcagt 


ggtgcgatct 


tggctcactg 


2940 




catcatccgc 


ctacccgttt 


caagggattc 


tcctgcctca 


gtcttcagag 


tatttgggac 


3000 




tacagacaca 


cgccaccatg 


cctggctaat 


tttttttttt 


tgagacggag 


tctcgctctg 


3060 


10 


ttacccaggc 


tggagtgcag 


tggcgtgatc 


ttggctcact 


gcaagctccg 


cctcccgggt 


3120 




tcacgccatt 


ctcctgcctc 


agcctcccga 


gtagctggga 


ctacaggagc 


ccaccaccac 


3180 




gcctggctaa 


ttttttgtat 


ttttactaga 


gacggggttt 


cgccgtgtta 


gccaggatgg 


3240 


15 


tctccatatc 


ctgacctcgt 


gatccgcctg 


cctcggcctt 


ccaaagtgct 


aggattatag 


3300 




gcgtgagcca 


ctgcgcctgg 


ccaatgcctg 


gctaattttt 


ttatattttt 


ggtagagaca 


3360 




gggttttgcc 


atgttgccca 


ggctggtctt 


gaaatcctga 


cctcaggtga 


tccgcccgcc 


3420 


20 


ttggcctccc 


aaagtgctgg 


gattacaggc 


atgagccacc 


acgcccggcc 


atgtacttta 


3480 




tgttaaaatg 


ggatcatatt 


ctagatcagc 


attatccagt 


agaaatttaa 


atttttaata 


3540 




cagggccagg 


cacggtggct 


catgcctgta 


atcccagcac 


tttcggaggc 


cgaggcgggt 


3600 


25 


ggatcgcaag 


gtcaggagat 


ttgagatcat 


cctggctaac 


agatgggtaa 


aaacccatct 


3660 




ctactaaaaa 


tacaaaaaat 


tagccatgca 


tggtggcatg 


cgcctgtagt 


cccagctact 


3720 




cgggaggctg 


aggccggaga 


atcacttgaa 


cccgggaggc 


agaggttgca 


gtgagccgag 


3780 


30 


atcgcgccac 


tgcattccaa 


cctgggtgac 


agagcgagac 


tccgtctgaa 


aaaaaaaaaa 


3840 




aatttaacac 


gtatgtagac 


aatgtgcaag 


gcaccattcc 


atgtgcatcg 


tatgtagtaa 


3900 




ctcttaattc 


tcacgataac 


cctgaggtag 


atattattac 


cccgttctac 


aaaaggagaa 


3960 


35 


acagtcctgg 


ggagacagga 


taagtcaccg 


gccaaggcac 


acagctagct 


acatgtggcc 


4020 




cccgcgtgac 


ggctggtctc 


tgtaggcgag 


gctttgtcca 


gatgcgtggg 


tagaaggtct 


4080 




ggcccggaaa 


gaggaactga 


cagcaaggct 


aagccaatgt 


ctgcccctgg 


gggcagaaag 


4140 


40 


tcacctcctg 


ctctccctcc 


actgtccaca 


gaggtagctc 


agacagggtg 


ggggtcacag 


4200 




gagaacgaag 


ggagaagggg 


gtagttcctg 


ggcagcaaaa 


tcaggtggtg 


aagggaggca 


4260 




tcagaggatg 


gcaattagag 


aggccattag 


aggggaacca 


caggcagaca 


gggtgacagg 


4320 


45 


agggactact 


gacacaaggt 


gaagagatgg 


cccagccgga 


cggggtggct 


cacatctgta 


4380 




atcccagcat 


tttgggagcc 


cgaggtgggt 


ggatcacttg 


aggtcaggag 


ttcgaggccc 


4440 




caacatggca 


aaaccccatc 


tcttctaaaa 


atacaaaaat 


tagccgggca 


tgatggcaga 


4500 


50 


tgcctgtaat 


ccctgctact 


cgggaggctg 


aggcaggaaa 


attgcctgaa 


tccaggaggt 


4560 




ggaggttgca 


atgagacgag 


atcatgacac 


tgcactccac 


cctgggcaac 


agagcaagag 


4620 




actgactctg 


tctcataaaa 


aaaaagaaaa 


aagaaaaaaa 


aaaagagatg 


gctgatggtt 


4680 


55 


aaagaggggt 


tagcggtcag 


gggacacata 


agggtaaagg 


caggaggcaa 


gaggactggc 


4740 




agggggctgc 


ccctgggcca 


ccgggagcga 


cacaggatga 


gcatggaggg 


aaagggagaa 


4800 
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ggggattcta 


gggtcccagc 


cracccaagt 


tgcccxc rgg 


x uccacciiag 


cargccagcc 






agaggcccag 


gaaggaaccc 


tgagcccccc 


accaaagcta 


aagatgagtc 


gctggagcct 


4y^o 


5 


ggtgaggggc 


aggatgaagg 


agctgctgga 


gacagtggtg 


aacaggacca 


gagacgggtg 


4yo0 




gcaatggttc 


tggtgagggt 


gtgctggcct 


gggtggtggg 


aggggactcc 


tgggtctgag 


_>U40 




ggaggagggg 


ctggggcctg 


gacccctgag 


t ctcagggag 


gaggaaaggg 


tgggagtggg 


blUU 


10 


gctgtgaccc 


ctaggtctgg 


gaggagtgga 


gggttagagc 


tgagagcagg 


aactcctagg 


blfoU 




tcacagagag 


gagcggataa 


atggggcaga 


gaacacctgg 


ggagagctgg 


ggcctccact 


b^U 




gtgatgtcct 


ctctcctgta 


ggagcccgag 


caccttccgg 


ggcttcatgc 


agacctacta 




15 


tgacgaccac 


ctgagggacc 


tgggtccgct 


caccaaggcc 


tggttcctcg 


aatccaaaga 


C 0 /I A 

bo40 




cagcctcttg 


aagaagaccc 


acagcctgtg 


ccccaggctt 


gtctgtgggg 


acaaggacca 


5400 




gggttaaaat 


gttcataaaa 


gccaggtgtg 


gttgtggcgg 


gtgcctgtag 


tcccagctac 


r a en 
b4o0 


20 


tcaggaggct 


gaggtaggat 


gatggcttga 


gcccaggagt 


tcgagaccag 


cctgggcaac 


bDzO 




acagcgagat 


ctcttggggg 


taaaacaaaa 


agaaaaaaaa 


aagttcatac 


ttctccaata 


bboO 




aataaagtct 


cacctgtgtc 


cctgtctgga 


tccttcccca 


gtgtggccag 


aaaaaaaccc 


C C A A 

5640 


25 


accccactgc 


ctcccaggaa 


tcaatgagta 


gaagaggtga 


cacctgatgg 


ggaaggaaga 


C7AA 

b/OO 




gtagggaggt 


cgggaagggt 


atcaaggaat 


aacaccctat 


tg tgggcttg 


cggagaatgg 


b/ bO 




gggacttcaa 


qgcgtqtcag 


tttcaggagg 


gtqaqggcag 

3 zj z* zj zj 


gagcgtgggt 

333 333 


ggagtcagca 


5820 


30 


ggtccccatg 


atggccctca 


ctgagagctt 


cgcccttgtc 


tcctacaagc 


tctgactcca 


5880 




ttcccagtgg 


gcacccagca 


cctccaaccc 


ctccacagcc 


cccaacccag 


cctctgtcgg 


5940 




aggcgaattc 












5950 


35 
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<211> 3549 
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40 
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45 


<400> 15 
cccctcttcc 


tcctcctcaa 


gggaaagctg 


cccac l Lcra 


gctgccctgc 


catcccc tt t 


CA 




aaagggcgac 


ttgctcagcg 


ccaaaccgcg 


gctccagccc 


tctccagcct 


ccggctcagc 


1 ">A 
1Z0 




cggctcatca 


gtcggtccgc 


gccttgcagc 


tcctccagag 


ggacgcgccc 


cgagatggag 


1 QA 

loO 


50 


agcaaagccc 


tgctcgtgct 


gactctggcc 


gtgtggctcc 


agagtctgac 


cgcctcccgc 


Z40 




ggaggggtgg 


ccgccgccga 


ccaaagaaga 


gattttatcg 


acatcgaaag 


taaatttgcc 


300 




ctaaggaccc 


ctgaagacac 


agctgaggac 


acttgccacc 


tcattcccgg 


agtagcagag 


360 


55 


tccgtggcta cctgtcattt 


caatcacagc 


agcaaaacct 


tcatggtgat 


ccatggctgg 


420 


acggtaacag 


gaatgtatga 


gagttgggtg 


ccaaaacttg 


tggccgccct 


gtacaagaga 


480 
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gaaccagact 


ccaatgtcat 


tgtggtggac 


tggctgtcac 


gggctcagga 


gear raccca 


D4U 


gtgtccgcgg 


gctacaccaa 


actggtggga 


caggatgtgg 


cccggtttat 


caactggatg 


DUU 


gaggaggagt 


ttaactaccc 


tctggacaat 


gtccatctct 


tgggatacag 


ccttggagcc 




catgctgctg 


gcattgcagg 


aagtctgacc 


aataagaaag 


tcaacagaat 


tactggcctc 


/ZU 


gatccagctg 


gacctaactt 


tgagtatgca 


gaagccccga 


gtcgtctttc 


tcctgatgat 


/oU 


gcagattttg 


tagacgtctt 


acacacattc 


accagagggt 


cccctggtcg 


aagcattgga 


o4U 


atccagaaac 


cagttgggca 


tgttgacat t 


tacccgaatg 


gaggtacttt 


tcagccagga 


yuu 


tgtaacattg 


gagaagctat 


ccgcgtgatt 


gcagagagag 


gacttggaga 


tgtggaccag 


you 


ctagtgaagt 


gctcccacga 


gcgctccatt 


catctcttca 


tcgactctct 


gttgaatgaa 


1UZU 


gaaaatccaa 


gtaaggccta 


caggtgcagt 


tccaaggaag 


cctttgagaa 


agggctctgc 


lUoU 


ttgagttgta 


gaaagaaccg 


ctgcaacaat 


ctgggctatg 


agatcaataa 


agtcagagcc 


1 1 A f\ 

114U 


aaaagaagca 


gcaaaatgta 


cctgaagact 


cgttctcaga 


tgccctacaa 


agtcttccat 


1ZUU 


taccaagtaa 


agattcattt 


ttctgggact 


gagagtgaaa 


cccataccaa 


tcaggccttt 


lzbU 


gagatttctc 


tgtatggcac 


cgtggccgag 


agtgagaaca 


tcccattcac 


tctgcctgaa 




gtttccacaa 


ataagaccta 


ctcct tccta 


atttacacag 


aggtagatat 


tggagaacta 


1 3 OA 


ctcatgttga 


agctcaaatg 


gaagagtgat 


tcatactt ta 


gctggtcaga 


ctggtggagc 


1 A A A 
144U 


agtcccggct 


tcgccattca 


gaagatcaga 


gtaaaagcag 


gagagactca 


gaaaaaggtg 


1 CAA 


atcttctgtt 


ctagggagaa 


agtgtctcat 


ttgcagaaag 


gaaaggcacc 


tgcggtattt 


1 CCA 


gtgaaatgcc 


atgacaagtc 


tctgaataag 


aagtcaggct 


gaaactgggc 


gaatctacag 


lbzU 


aacaaagaac 


ggcatgtgaa 


ttctgtgaag 


aatgaagtgg 


aggaagtaac 


ttttacaaaa 


lDoU 


catacccagt 


gtttggggtg 


tttcaaaagt 


ggattttcct 


gaatattaat 


cccagcccta 


1 7 A A 
1/ 4U 


cccttgttag 


ttattttagg 


agacagtctc 


aagcactaaa 


aagtggctaa 


-«-■*- -% -« -t- -4- -4- -» -#- 

ttcaatttat 


1 OOA 


ggggtatagt 


ggccaaatag 


cacatcctcc 


aacgttaaaa 


gacagtggat 


catgaaaagt 


1 OCA 


gctgttttgt 


cctttgagaa 


agaaataatt 


gtttgagcgc 


agagtaaaat 


aaggctcctt 


iyzu 


catgtggcgt 


attgggccat 


agcctataat 


tggttagaac 


ctcctatttt 


aattggaatt 


i no A 


ctggatct tt 


cggactgagg 


ccttctcaaa 


ctttactcta 


agtctccaag 


aatacagaaa 


"> A>1 A 

ZU4U 


atgcttttcc 


gcggcacgaa 


tcagactcat 


ctacacagca 


gtatgaatga 


tgttttagaa 


")1 A A 


tgattccctc 


ttgctattgg 


aatgtggtcc 


agacgtcaac 


caggaacatg 


taacttggag 


ZlbU 


agggacgaag 


aaagggtctg 


ataaacacag 


aggttttaaa 


cagtccctac 


cattggcctg 


*5 7*5 A 


catcatgaca 


aagttacaaa 


ttcaaggaga 


tataaaatct 


agatcaatta 


attcttaata 


ZZoU 


ggctttatcg 


tttattgctt 


aatccctctc 


tcccccttct 


tttttgtctc 


aagattatat 


2340 


tataataatg 


ttctctgggt 


aggtgttgaa 


aatgagcctg 


taatcctcag 


ctgacacata 


2400 


atttgaatgg 


tgcagaaaaa 


aaaaagatac 


cgtaatttta 


ttattagatt 


ctccaaatga 


2460 


ttttcatcaa 


tttaaaatca 


ttcaatatct 


gacagttact 


cttcagtttt 


aggcttacct 


2520 
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tggtcatgct 


tcagttgtac 


ttccagtgcg 


tctcttttgt 


tcctggcttt 


gacatgaaaa 


2580 




gataggtttg 


agttcaaatt 


ttgcattgtg 


tgagcttcta 


cagattttag 


acaaggaccg 


2640 


5 


tttttactaa 


gtaaaagggt 


ggagaggttc 


ctggggtgga 


ttcctaagca 


gtgcttgtaa 


2700 




accatcgcgt 


gcaatgagcc 


agatggagta 


ccatgagggt 


tgttatttgt 


tgtttttaac 


2760 




aactaatcaa 


gagtgagtga 


acaactattt 


ataaactaga 


tctcctattt 


ttcagaatgc 


2820 


10 


tcttctacgt 


ataaatatga 


aatgataaag 


atgtcaaata 


tctcagaggc 


tatagctggg 


2880 




aacccgactg 


tgaaagtatg 


tgatatctga 


acacatacta 


gaaagctctg 


catgtgtgtt 


2940 




gtccttcagc 


ataattcgga 


agggaaaaca 


gtcgatcaag 


ggatgtattg 


gaacatgtcg 


3000 


15 


gagtagaaat 


tgttcctgat 


gtgccagaac 


ttcgaccctt 


tctctgagag 


agatgatcgt 


3060 




gcctataaat 


agtaggacca 


atgttgtgat 


taacatcatc 


aggcttggaa 


tgaattctct 


3120 




ctaaaaataa 


aatgatgtat 


gatttgttgt 


tggcatcccc 


tttattaatt 


cattaaattt 


3180 


20 


ctggatttgg 


gttgtgaccc 


agggtgcatt 


aacttaaaag 


attcactaaa 


gcagcacata 


3240 




gcactgggaa 


ctctggctcc 


gaaaaacttt 


gttatatata 


tcaaggatgt 


tctggcttta 


3300 




cattttattt 


attagctgta 


aatacatgtg 


tggatgtgta 


aatggagctt 


gtacatattg 


3360 


25 


gaaaggtcat 


tgtggctatc 


tgcatttata 


aatgtgtggt 


gctaactgta 


tgtgtcttta 


3420 




tcagtgatgg 


tctcacagag 


ccaactcact 


cttatgaaat 


gggctttaac 


aaaacaagaa 


3480 




agaaacgtac 


ttaactgtgt 


gaagaaatgg 


aatcagcttt 


taataaaatt 


gacaacattt 


3540 


in 


tattaccac 












3549 



<210> 16 

<211> 1790 

<212> DNA 

<213> Homo sapiens 



<400> 16 
gtgaatctct 


ggggccagga 


agaccctgct 


gcccggaaga 


gcctcatgtt 


ccgtgggggc 


60 


tgggcggaca 


tacatatacg 


ggctccaggc 


tgaacggctc 


gggccactta 


cacaccactg 


120 


cctgataacc 


atgctggctg 


ccacagtcct 


gaccctggcc 


ctgctgggca 


atgcccatgc 


180 


ctgctccaaa 


ggcacctcgc 


acgaggcagg 


catcgtgtgc 


cgcatcacca 


agcctgccct 


240 


cctggtgttg 


aaccacgaga 


ctgccaaggt 


gatccagacc 


gccttccagc 


gagccagcta 


300 


cccagatatc 


acgggcgaga 


aggccatgat 


gctccttggc 


caagtcaagt 


atgggttgca 


360 


caacatccag 


atcagccact 


tgtccatcgc 


cagcagccag 


gtggagctgg 


tggaagccaa 


420 


gtccattgat 


gtctccattc 


agaacgtgtc 


tgtggtcttc 


aaggggaccc 


tgaagtatgg 


480 


ctacaccact 


gcctggtggc 


tgggtattga 


tcagtccatt 


gacttcgaga 


tcgactctgc 


540 


cattgacctc 


cagatcaaca 


cacagctgac 


ctgtgactct 


ggtagagtgc 


ggaccgatgc 


600 
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ccctgactgc 


tacctgtctt 


tccataagct 




tgggtggatc 


aagcagctgt 


tcacaaattt 


5 


gggacagatc 


tgcaaagaga 


tcaacgtcat 




aagggctgcc 


agcatccttt 


cagatggaga 




tcccgtcatc 


acagcctcct 


acctggagtc 


10 


tgtctcagag 


gacctccccc 


tccccacctt 




gctgtacttc 


tggttctctg 


agcgagtctt 




tggccgcctc 


atgctcagcc 


tgatgggaga 


15 


cttcaacacc 


aaccaggaaa 


tcttccaaga 




agtcaccgtc 


cactgcctca 


agatgcccaa 




caattcttca 


gtgatggtga 


aattcctctt 


20 


ttacacattt 


gaagaggata 


tcgtgactac 




cttcttaagc 


ctcttggatt 


tccagattac 




cagctccgag 


tccatccaga 


gcttcctgca 


25 


ggtcatgtct 


cggctcgagg 


tagtgtttac 




cttcgacatc 


atcaaccctg 


agattatcac 




ctttnarttr 




ty l iy y iy ya 


30 






l y Luy^.u^aa 




ggtgtctcct 


ccagcgtggt 


ggaagttggg 




aactcctccc 


tatcctaaag 


gcccactggc 


35 


<210> 17 







<211> 2688 

<212> DNA 

<213> Homo sapiens 



<400> 17 

tttaaagctg ggaggttctg ccaccaagca 
ggcgggaaga gcccggaaag aaacctgtgg 
aaggtgacct cacattcgtg ccccttagca 
aaacggcctg aacctctcgg ccaaagttgt 
agtgagggag tttctcgaga ataacgctga 
tgacggctct gaggaggaga atgggcggct 
caggcggctg aagaagtatg acaactgctg 
caggatcgaa agcaagacgg ttatcgtcac 



gctcctgcat 


ctccaagggg 


agcgagagcc 


660 


catctccttc 


accctgaagc 


tggtcctgaa 


"7 "> n 

720 


ctctaacatc 


atggccgatt 


ttgtccagac 


1 o r\ 

780 


cattggggtg 


gacatttccc 


tgacaggtga 


840 


ccatcacaag 


ggtcatttca 


tctacaagaa 


900 


ctcgcccaca 


ctgctggggg 


actcccgcat 


960 


ccactcgctg 


gccaaggtag 


ctttccagga 


1 ATA 

1020 


cgagttcaag 


gcagtgctgg 


agacctgggg 


1080 


ggttgtcggc 


ggcttcccca 


gccaggccca 


1 "1 A f\ 

1140 


gatctcctgc 


caaaacaagg 


gagtcgtggt 


1200 


tccacgccca 


gaccagcaac 


attctgtagc 


1260 


cgtccaggcc 


tcctattcta 


agaaaaagct 


1320 


accaaagact 


gtttccaact 


tgactgagag 


1380 


gtcaatgatc 


accgctgtgg 


gcatccctga 


1440 


agccctcatg 


aacagcaaag 


gcgtgagcct 


1500 


tcgagatggc 


ttcctgctgc 


tgcagatgga 


1560 


tttcctccag 


agcttgagct 


agaagtctcc 


1620 


ggcaagcacc 


aggctcacag 


ctggaaccct 


1680 


ttaggagtac 


ggagatggag 


attggctccc 


1740 


attaaagtgc 


tgtatccaag 




1790 



cggccttccc 


actgggaaca 


caaacttgct 


60 


atctcccttc 


gagatcatcc 


aaagagaaga 


120 


gcactctgca 


gaaatgcctc 


ctcagctgca 


180 


ccagggaagc 


ctggacagcc 


tgccccaggc 


240 


gctgtgtcag 


cctgatcaca 


tccacatctg 


300 


tctgggccag 


atggaggaag 


agggcatcct 


360 


gttggctctc 


actgacccca 


gggatgtggc 


420 


ccaagagcaa 


agagacacag 


tgcccatccc 


480 
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caaaacaggc 


ctcagccagc 


tcggtcgctg 


gatgtcagag 


gaggattttg 


agaaagcgtt 


540 




caatgccagg 


ttcccagggt 


gcatgaaagg 


tcgcaccatg 


tacgtcatcc 


cattcagcat 


600 


5 


ggggccgctg 


ggctcgcctc 


tgtcaaagat 


cggcatcgag 


ctgacggatt 


caccctacgt 


660 




ggtggccagc 


atgcggatca 


tgacgcggat 


gggcacgccc 


gtcctggaag 


cactgggcga 


720 




tggggagttt 


gtcaaatgcc 


tccattctgt 


ggggtgccct 


ctgcctttac 


aaaagccttt 


780 


10 


ggtcaacaac 


tggccctgca 


acccggagct 


gacgctcatc 


gcccacctgc 


ctgaccgcag 


840 




agagatcatc 


tcctttggca 


gtgggtacgg 


cgggaactcg 


ctgctcggga 


agaagtgctt 


900 




tgctctcagg 


atggccagcc 


ggctggccaa 


ggaggaaggg 


tggctggcag 


agcacatgct 


960 


15 


ggttctgggt 


ataaccaacc 


ctgagggtga 


gaagaagtac 


ctggcggccg 


catttcccag 


1020 




cgcctgcggg 


aagaccaacc 


tggccatgat 


gaaccccagc 


ctccccgggt 


ggaaggttga 


1080 




gtgcgtcggg 


gatgacattg 


cctggatgaa 


gtttgacgca 


caaggtcatt 


taagggccat 


1140 


20 


caacccagaa 


aatggctttt 


tcggtgtcgc 


tcctgggact 


tcagtgaaga 


ccaaccccaa 


1200 




tgccatcaag 


accatccaga 


agaacacaat 


ctttaccaat 


gtggccgaga 


ccagcgacgg 


1260 




gggcgtttac 


tgggaaggca 


ttgatgagcc 


gctagcttca 


ggtgtcacca 


tcacgtcctg 


1320 


25 


gaagaataag 


gagtggagct 


cagaggatgg 


ggaaccttgt 


gcccacccca 


actcgaggtt 


1380 




ctgcacccct 


gccagccagt 


gccccatcat 


tgatgctgcc 


tgggagtctc 


cggaaggtgt 


1440 




tcccattgaa 


ggcattatct 


ttggaggccg 


tagacctgct 


ggtgtccctc 


tagtctatga 


1500 


30 


agctctcagc 


tggcaacatg 


gagtctttgt 


gggggcggcc 


atgagatcag 


aggccacagc 


1560 




ggctgcagaa 


cataaaggca 


aaatcatcat 


gcatgacccc 


tttgccatgc 


ggcccttctt 


1620 




tggctacaac 


ttcggcaaat 


acctggccca 


ctggcttagc 


atggcccagc 


acccagcagc 


1680 


35 


caaactgccc 


aagatcttcc 


atgtcaactg 


gttccggaag 


gacaaggaag 


gcaaattcct 


1740 




ctggccaggc 


tttggagaga 


actccagggt 


gctggagtgg 


atgttcaacc 


ggatcgatgg 


1800 




aaaagccagc 


accaagctca 


cgcccatagg 


ctacatcccc 


aaggaggatg 


ccctgaacct 


1860 


40 


gaaaggcctg 


gggcacatca 


acatgatgga 


gcttttcagc 


atctccaagg 


aattctggga 


1920 




gaaggaggtg 


gaagacatcg 


agaagtatct 


ggaggatcaa 


gtcaatgccg 


acctcccctg 


1980 




tgaaatcgag 


agagagatcc 


ttgccttgaa 


gcaaagaata 


agccagatgt 


aatcagggcc 


2040 


45 


tgagtgcttt 


acctttaaaa 


tcattccctt 


tcccatccat 


aaggtgcagt 


aggagcaaga 


2100 




gagggcaagt 


gttcccaaat 


tgacgccacc 


ataataatca 


tcaccacacc 


gtgagcagat 


2160 




ctgaaaggca 


cactttgatt 


tttttaagga 


taagaaccac 


agaacactgg 


gtagtagcta 


2220 


50 


atgaaattga 


gaagggaaat 


cttagcatgc 


ctccaaaaat 


tcacatccaa 


tgcatagttt 


2280 




gttcaaattt 


aaggttactc 


aggcattgat 


cttttcagtg 


ttttttcact 


ttagctatgt 


2340 




ggattagcta 


gaatgcacac 


caaaaaaata 


cttgagctgt 


atatatatat 


gtgtgtgtgt 


2400 


55 


gtgtgtgtgt 


gtgtgtgtgt 


gtgtgcatgt 


atgtgcacat 


gtgtctgtgt 


ggtatatttg 


2460 




tgtatgtgta 


tttgtatgta 


ctgttattga 


aaatatattt 


aatacctttg 


gaaaaatctt 


2520 
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gggcaagatg 


acctactagt 


tttccttgaa 


aaaaagttgc 


tttgttatta 


atattgtgct 


2580 




taaattattt 


ttatacacca 


ttgttcctta 


cctttacata 


attgeaatat 


ttccccctta 


2640 


5 


ctacttcttg 
<210> 18 


gaaaaaaatt 


acaaaatgaa 


gttttataga 


aaagatgg 




2688 




<211> 3095 












10 


<212> DNA 
















<213> Homo sapiens 












15 


<400> 18 
ragcagagca 


a rcaccacca 


agcctggaat 


aactgeaagg 


getctgetga 


catcttcctg 


60 




agg rgccaag 


gaaa rgagga 


tggaggaagg 


aatgaatgtt 


ctccatgact 


ttgggatcca 


120 


20 


y LCddCdCd L 


race xccagg 


tgaattacca 


agactcccag 


gactggttca 


tcttggtgtc 


180 




cgiga rcgca 


gacc tcagga 


atgecttcta 


cgtcctcttc 


cccatctggt 


tccatcttca 


240 




ggaagcrg xg 


ggcax uaaac 


tcctttgggt 


agctgtgatt 


ggagactggc 


tcaacctcgt 


300 


25 


ctttaagtgg 


diLCicitig 


gacagcgtcc 


atactggtgg 


gttttggata 


ctgactacta 


360 




cagcaacac l 


f f r\ + r\ f f r- r 

cccgTgcccc 


tgataaagca 


gttccctgta 


acctgtgaga 


ctggaccagg 


420 




gagcccc lc l 


ggeca igcca 


tgggcacagc 


aggtgtatac 


tacgtgatgg 


tcacatctac 


480 


30 


tCIIICCdLC 


l l rcagggaa 


agataaagee 


gacctacaga 


tttcggtgct 


tgaatgtcat 


540 




l l ug xgg l tg 


gga llc uggg 


ctgtgcagct 


gaatgtctgt 


ctgtcacgaa 


tctaccttgc 


600 




-i-^*--*-/ — , -•- -»- -*- +- 

tyC lCSllll 


ccica Lcaag 


ttgttgctgg 


agtcctgtca 


ggcattgctg 


ttacagaaac 


660 


35 


n rcagccac 


a rccacagca 


tetataatge 


cagcctcaag 


aaatattttc 


tcattacctt 


720 




/- 1- 1- r t- 1- n 1- 1- 


agcLLcgcca 


teggatttta 


tctgctgctc 


aagggactgg 


gtgtagacct 


780 




cctgtggact 


ctggagaaag 


cccagaggtg 


gtgcgagcag 


ccagaatggg 


tccacattga 


840 


40 


caccacaccc 


tttgccagcc 


tcctcaagaa 


cctgggcacg 


ctctttggcc 


tggggctggc 


900 




tctcaactcc 


agcatgtaca 


gggagagctg 


caaggggaaa 


ctcagcaagt 


ggctcccatt 


960 




ccgcctcagc 


tctattgtag 


cctccctcgt 


cctcctgcac 


gtctttgact 


ccttgaaacc 


1020 


45 


cccatcccaa 


gtcgagctgg 


tettctaegt 


cttgtccttc 


tgcaagagtg 


eggtagtgee 


1080 




cctggcatcc 


gtcagtgtca 


tcccctactg 


cctcgcccag 


gtcctgggcc 


agccgcacaa 


1140 




gaagtcgttg 


taagagatgt 


ggagtcttcg 


gtgtttaaag 


tcaacaacca 


tgecagggat 


1200 


50 


tgaggaggac 


tactatttga 


agcaatgggc 


actggtattt 


ggagcaagtg 


acatgccatc 


1260 




cattctgccg 


tcgtggaatt 


aaatcacgga 


tggcagattg 


gagggtcgee 


tggcttattc 


1320 




ccatgtgtga 


ctccagcctg 


ccctcagcac 


agactctttc 


agatggaggt 


gccatatcac 


1380 


55 


gtacaccata 


tgcaagtttc 


ccgccaggag 


gtcctcctct 


ctctacttga 


atactctcac 


1440 




aagtagggag 


ctcactccca 


ctggaacagc 


ccattttatc 


tttgaatggt 


cttctgccag 


1500 
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cccattttga 


ggccagaggt 


gctgtcagct 


caggtggtcc 


tcttttacaa 


tcctaatcat 


IdoU 


attgggtaat 


gtttttgaaa 


agctaatgaa 


gctattgaga 


aagacctgtt 


gctagaagtt 


1 OA 


gggttgttct 


ggattttccc 


ctgaagactt 


act tattctt 


ccgtcacata 


tacaaaagca 


lboU 


agacttccag 


gtagggccag 


ctcacaagcc 


caggctggag 


atcctaactg 


agaat tttct 


1/4U 


acctgtgttc 


attcttaccg 


agaaaaggag 


aaaggagctc 


tgaatctgat 


aggaaaagaa 


loUU 


ggctgcctaa 


ggaggagttt 


ttagtatgtg 


gcgtatcatg 


caagtgctat 


gccaagccat 


1860 


gtctaaatgg 


ctttaattat 


atagtaatgc 


actctcagta 


atgggggacc 


agcttaagta 


1920 


taattaatag 


atggttagtg 


gggtaattct 


gcttctagta 


ttttttttac 


tgtgcataca 


1980 


tgttcatcgt 


atttccttgg 


atttctgaat 


ggctgcagtg 


acccagatat 


tgcactaggt 


ZU40 


caaaacattc 


aggtatagct 


gacatctcct 


ctatcacatt 


acatcatcct 


ccttataagc 


2100 


ccagctctgc 


tttttccaga 


ttcttccact 


ggctccacat 


ccaccccact 


ggatcttcag 


2160 


aaggctagag 


ggcgactctg 


gtggtgcttt 


tgtatgtttc 


aattaggctc 


tgaaatcttg 


2220 


ggcaaaatga 


caaggggagg 


gccaggattc 


ctctctcagg 


tcactccagt 


gttactttta 


2280 


attcctagag 


ggtaaatatg 


actcctttct 


ctatcccaag 


ccaaccaaga 


gcacattctt 


2340 


aaaggaaaag 


tcaacatctt 


ctctcttttt 


tttttttttt 


gagacagggt 


ctcactatgt 


2400 


tgcccaggct 


gctcttgaat 


tcctgggctc 


aagcagtcct 


cccaccctac 


cacagcgtcc 


2460 


cgcgtagctg 


gcatacaggt 


gcaagccact 


atgtccagct 


agccaactcc 


tccttgcctg 


2520 


cttttctttt 


tttttctttt 


tttgagacgg 


cgcacctatc 


acccaggctg 


gagtggagtg 


2580 


gcacgatctt 


ggctcactgc 


aacctcttcc 


tcctggttca 


agcgattctc 


atgtctcagc 


2640 


ctcctcagta 


gctaggacta 


ccggcgtgca 


ccaccatgcc 


aggctaattt 


ttatattttt 


2700 


agaattttag 


aagagatggg 


atttcatcat 


gttggccagg 


ctggtctcga 


actcctgacc 


2760 


tcaagtgatc 


cacctgcctt 


ggcctcccaa 


ggtgctagga ttacaggcat gagccaccgc 


2820 


accgggccct 


ccttgcctgt 


ttttcaatct 


catctgatat 


gcagagtatt 


tctgccccac 


2580 


ccacctaccc 


cccaaaaaaa 


gctgaagcct 


atttatttga aagtccttgt 


ttttgctact 


2940 


aattatatag 


tataccatac 


attatcattc 


aaaacaacca 


tcctgctcat 


aacatctttg 


3000 


aaaagaaaaa 


tatatatgtg 


cagtatttta 


ttaaagcaac attttattta agaataaagt 


3060 


cttgttaatt 


actatatttt 


agatgcaatg 


tgatc 






3095 



<210> 19 

<211> 2128 

<212> DNA 

<213> Homo sapiens 

<400> 19 

gggggtccca tcgggcccgc cctcgcacgt cactccggga cccccgcggc ctccgcaggt 60 
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tctgcgctcc 


aggccggagt 


cagagactcc 


aggatcggtt 


ctttcatCLt 


cgccgcccct 


120 


gcgcgtccag 


ctcttctaag 


acgagatgcc 


gtcgggcttc 


caacagatag 


gctccgaaga 


i qa 
180 


tggggaaccc 


cctcagcagc 


gagtgactgg 


gaccctggtc 


cttgctgtgt 


tctctgcggt 


240 


gcttggctcc 


ctgcagtttg 


ggtacaacat 


tggggtcatc 


aatgcccctc 


agaaggtgat 


300 


tgaacagagc 


tacaatgaga 


cgtggctggg 


gaggcagggg 


cctgagggac 


ccagctccat 


3b0 


ccctccaggc 


accctcacca 


ccctctgggc 


cctctccgtg 


gccatc tt l l 


ccgtgggcgg 


420 


catgatttcc 


tccttcctca 


ttggtatcat 


ctctcagtgg 


cttggaagga 


aaagggccat 


/I OA 

480 


gctggtcaac 


aatgtcctgg 


cggtgctggg 


gggcagcctc 


atgggcctgg 


ccaacgctgc 


I>40 


tgcctcctat 


gaaatgctca 


tccttggacg 


attcctcatt 


ggcgcctact 


cagggctgac 


C AA 

bOO 


atcagggctg 


gtgcccatgt 


acgtggggga 


gattgctccc 


actcacctgc 


ggggcgccct 


bbO 


ggggacgctc 


aaccaactgg 


ccattgttat 


cggcattctg 


atcgcccagg 


tgctgggctt 


"7 ~) A 
720 


ggagtccctc 


ctgggcactg 


ccagcctgtg 


gccactgctc 


ctgggcctca 


cagtgctacc 


"7 OA 

780 


tgccctcctg 


cagctggtcc 


tgctgccctt 


ctgtcccgag 


agcccccgct 


acctctacat 


840 


catccagaat 


ctcgaggggc 


ctgccagaaa 


gagtctgaag 


cgcctgacag 


gctgggccga 


AAA 

900 


tgtttctgga 


gtgctggctg 


agctgaagga 


tgagaagcgg 


aagctggagc 


gtgagcggcc 


ybO 


actgtccctg 


ctccagctcc 


tgggcagccg 


tacccaccgg 


cagcccctga 


tcattgcggt 


i m a 
1020 


cgtgctgcag 


ctgagccagc 


agctctctgg 


catcaatgct 


gtt ttctatt 


attcgaccag 


1 AOA 

1080 


catcttcgag 


acagcagggg 


taggccagcc 


tgcctatgcc 


accataggag 


ctggtgtggt 


1 1 vl A 

1140 


caacacagtc 


ttcaccttgg 


tctcggtgtt 


gttggtggag 


cgggcggggc 


gccggacgct 


1 ~> A A 
1200 


ccatctcctg 


ggcctggcgg 


gcatgtgtgg 


ctgtgccatc 


ctgatgactg 


tggctctgct 


IZdO 


cctgctggag 


cgagttccag 


ccatgagcta 


cgtctccatt 


gtggccatct 


ttggcttcgt 


1320 


ggcatttttt 


gagattggcc 


ctggccccat 


tccttggttc 


atcgtggccg 


agctcttcag 


"1 3 O A 

1380 


ccagggaccc 


cgcccggcag 


ccatggctgt 


ggctggtttc 


tccaactgga 


cgagcaactt 


1 A A A 

1440 


catcattggc 


atgggtttcc 


agtatgttgc 


ggaggctatg 


gggccctacg 


tcttccttct 


"1 r AA 

1500 


at ttgcggtc 


ctcctgctgg 


gcttcttcat 


cttcaccttc 


ttaagagtac 


ctgaaactcg 


1 c a a 
lbbO 


aggccggacg 


tttgaccaga 


tctcagctgc 


cttccaccgg 


acaccctctc 


ttttagagca 


1620 


ggaggtgaaa 


cccagcacag 


aacttgagta 


tttagggcca 


gatgagaacg 


actgaggggc 


1 con 
1580 


caggcagggg 


tgggagagcc 


agctctctct 


acccggccca 


gagacccctt 


cctttcctct 


1 7 /I A 

1/40 


gcagcacttt 


aaccctctct 


tccctattat 


ttccgggtgg 


aaaagaatcc 


ctgcagcctg 


1800 


gtagaattgg 


gaagctgggg 


gaagggtggt 


ctgagcaccc 


cctcattccc 


ctcgtgtgac 


lobO 


tctcttggat 


tatttatgtg 


ttgtggtttg 


gccgtggcca 


tcagggtggg 


ccactctccc 


1920 


ctccctcttc 


cttcccccat 


cccctttcct 


ccccaccttc 


cccagactca 


gctccagaat 


1980 


accttcttcg 


ctgctagaga 


agggggattg 


gagggaagac 


aggtctagac 


tttctcagtg 


2040 


ggacaaacca 


gagcagagag 


caggacagga 


gacaagaaat 


ccagtttccc 


accaccttgg 


2100 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



actcctccca caatctggga ctttcact 2128 

<210> 20 

<211> 1496 

<212> DNA 

<213> Homo sapiens 



<400> 20 
ggcacgagga 


aaatcaagat 


aaaaatgttc 


acaattaagc 


tccttctttt 


tattgttcct 


60 


ctagttattt 


cctccagaat 


tgatcaagac 


aattcatcat 


ttgattctct 


atctccagag 


120 


ccaaaatcaa 


gatttgctat 


gttagacgat 


gtaaaaattt 


tagccaatgg 


cctccttcag 


180 


ttgggacatg 


gtcttaaaga 


ctttgtccat 


aagacgaagg 


gccaaattaa 


tgacatattt 


240 


caaaaactca 


acatatttga 


tcagtctttt 


tatgatctat 


cgctgcaaac 


cagtgaaatc 


300 


aaagaagaag 


aaaaggaact 


gagaagaact 


acatataaac 


tacaagtcaa 


aaatgaagag 


360 


gtaaagaata 


tgtcacttga 


actcaactca 


aaacttgaaa 


gcctcctaga 


agaaaaaatt 


420 


ctacttcaac 


aaaaagtgaa 


atatttagaa 


gagcaactaa 


ctaacttaat 


tcaaaatcaa 


480 


cctgaaactc 


cagaacaccc 


agaagtaact 


tcacttaaaa 


cttttgtaga 


aaaacaagat 


540 


aatagcatca 


aagaccttct 


ccagaccgtg 


gaagaccaat 


ataaacaatt 


aaaccaacag 


600 


catagtcaaa 


taaaagaaat 


agaaaatcag 


ctcagaagga 


ctagtattca 


agaacccaca 


660 


gaaatttctc 


tatcttccaa 


gccaagagca 


ccaagaacta 


ctccctttct 


tcagttgaat 


720 


gaaataagaa 


atgtaaaaca 


tgatggcatt 


cctgctgaat 


gtaccaccat 


ttataacaga 


780 


ggtgaacata 


caagtggcat 


gtatgccatc 


agacccagca 


actctcaagt 


ttttcatgtc 


840 


tactgtgatg 


ttatatcagg 


tagtccatgg 


acattaattc 


aacatcgaat 


agatggatca 


900 


caaaacttca 


atgaaacgtg 


ggagaactac 


aaatatggtt 


ttgggaggct 


tgatggagaa 


960 


ttttggttgg 


gcctagagaa 


gatatactcc 


atagtgaagc 


aatctaatta 


tgttttacga 


1020 


attgagttgg 


aagactggaa 


agacaacaaa 


cattatattg 


aatattcttt 


ttacttggga 


1080 


aatcacgaaa 


ccaactatac 


gctacatcta 


gttgcgatta 


ctggcaatgt 


ccccaatgca 


1140 


atcccggaaa 


acaaagattt 


ggtgttttct 


acttgggatc 


acaaagcaaa 


aggacacttc 


1200 


aactgtccag 


agggttattc 


aggaggctgg 


tggtggcatg 


atgagtgtgg 


agaaaacaac 


1260 


ctaaatggta 


aatataacaa 


accaagagca 


aaatctaagc 


cagagaggag 


aagaggatta 


1320 


tcttggaagt 


ctcaaaatgg 


aaggttatac 


tctataaaat 


caaccaaaat 


gttgatccat 


1380 


ccaacagatt 


cagaaagctt 


tgaatgaact 


gaggcaaatt 


taaaaggcaa 


taatttaaac 


1440 


attaacctca 


ttccaagtta 


atgtggtcta 


ataatctggt 


attaaatcct 


taagag 


1496 



<210> 21 
<211> 1415 
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<212> DNA 

<213> Homo sapiens 

5 



<400> 21 
ggaggaggga 


gagagagaga 


agagaagaaa 


agatttgttc 


gaaatcttga 


ggagtcttca 


aaaaatatct 


cctccccatt 


ctggggctct 


aggaattcag 


accagagatg 


ctccaaggaa 


ggatcggaag 


agagatggct 


tatcatctgg 


cgaggtcaaa 


agaaactcta 


cagaaggtgg 


cagcacacta 


cattgctggc 


accatggaag 


aagcaggaaa 


gctcatggga 


ggactagaca 


ctttgaatct 


ttttcatgat 


gatattcacc 


tcagttacgt 


ggtcctgact 


gtagctgcct 


ttgttgtcgt 


ctcctctctg 


gctgggaaag 


caagcaagtt 


tgctttggat 


gggttcttct 


gggtcaatgt 


atcaatcact 


ctctgtgttc 


aggcagtttc 


tgggatagtc 


catatgcaag 


tcatcaaagg 


gggagctctg 


cgccaagaag 


ctcttctgat 


cagaaatcca 


tgcaggaaga 


atatggacag 


attcataaac 


aagtaggaac 


tgggactgtt 


ctgtctcatg 


tttatctgag 


gtccccagag 


acatgcaagt 


catgggtcac 


tttgcaaaat 


ggaaatgtaa 


taataatgaa 


aaattgttag 


taaacatagg 


tataattacc 


ataataatat 


gtgatgatta 


atacaatatt 


aattcataac 


tggtagctat 


aacttgagct 


ctgtacaaat 


gaaatttttc 


aatatttgtt 



<210> 22 

so <211> 1405 

<212> DNA 

<213> Homo sapiens 

55 

<400> 22 



aagaaaaaag 


aacatcaata 


aaaagaagtc 


60 


ggccagctcc 


ctgtcggatg 


gcttttatga 


120 


tcatggccta 


ctactactat 


tctgcaaacg 


180 


agaaagtgat 


tgtcacaggg 


gccagcaaag 


240 


cgaagatggg 


agcccatgtg 


gtggtgacag 


300 


tatcccactg 


cctggagctt 


ggagcagcct 


360 


acatgacctt 


cgcagagcaa 


tttgttgccc 


420 


tgctcattct 


caaccacatc 


accaacactt 


480 


atgtgcgcaa 


aagcatggaa 


gtcaacttcc 


540 


tgcccatgct 


gaagcagagc 


aatggaagca 


600 


tggcttatcc 


aatggttgct 


gcctattctg 


660 


cctccatcag 


aaaggaatat 


tcagtgtcca 


720 


ttggcctcat 


agacacagaa 


acagccatga 


780 


cagctccaaa 


ggaggaatgt 


gccctggaga 


840 


aagtgtatta tgacagctca 


ctctggacca 


900 


tcctggaatt 


tctctactca 


acgagctata 


960 


tccctgaggg 


ctgggcatgc 


i_y ciyy y d l l l 


i o?n 

X.KJ C\J 


ctcttatcta 


tgaagacatc 


ttcccagagt 


1080 


acctgacaaa 


tggaaggagt 


tcctctaaca 


1140 


tgtcatgcac 


cgctgcagcc 


agcagttgta 


1200 


agatagttat 


attaaattta 


tatcttatat 


1260 


aattataata 


aaggtcacat 


aaactttata 


1320 


tattcaggat 


ggtttcttta 


aaaccataaa 


1380 


tctta 






1415 



55 



EP 1 398 032 A1 





acaattcaga 


ggctgctgcc 


tgcttaggag 


gttgtagaaa 


gctctgtagg 


ttctctctgt 


60 




gtgtcctaca 


ggagtcttca 


ggccagctcc 


ctgtcggatg 


gcttttatga 


aaaaatatct 


120 


5 


cctccccatt 


ctggggctct 


tcatggccta 


ctactactat 


tctgcaaacg 


aggaattcag 


180 




accagagatg 


ctccaaggaa 


agaaagtgat 


tgtcacaggg 


gccagcaaag 


ggatcggaag 


240 




agagatggct 


tatcatctgg 


cgaagatggg 


agcccatgtg 


gtggtgacag 


cgaggtcaaa 


300 


10 


agaaactcta 


cagaaggtgg 


tatcccactg 


cctggagctt 


ggagcagcct 


cagcacacta 


360 




cattgctggc 


accatggaag 


acatgacctt 


cgcagagcaa 


tttgttgccc 


aagcaggaaa 


420 




gctcatggga 


ggactagaca 


tgctcattct 


caaccacatc 


accaacactt 


ctttgaatct 


480 


15 


ttttcatgat 


gatattcacc 


atgtgcgcaa 


aagcatggaa 


gtcaacttcc 


tcagttacgt 


540 




ggtcctgact 


gtagctgcct 


tgcccatgct 


gaagcagagc 


aatggaagca 


ttgttgtcgt 


600 




ctcctctctg 


gctgggaaag 


tggcttatcc 


aatggttgct 


gcctattctg 


caagcaagtt 


660 


20 


tgctttggat 


gggttcttct 


cctccatcag 


aaaggaatat 


tcagtgtcca 


gggtcaatgt 


720 




atcaatcact 


ctctgtgttc 


ttggcctcat 


agacacagaa 


acagccatga 


aggcagtttc 


780 




tgggatagtc 


catatgcaag 


cagctccaaa 


ggaggaatgt 


gccctggaga 


tcatcaaagg 


840 


25 


gggagctctg 


cgccaagaag 


aagtgtatta 


tgacagctca 


ctctggacca 


ctcttctgat 


900 




cagaaatcca 


tgcaggaaga 


tcctggaatt 


tctctactca 


acgagctata 


atatggacag 


960 




attcataaac 


aagtaggaac 


tccctgaggg 


ctgggcatgc 


tgagggattt 


tgggactgtt 


1020 


30 


ctgtctcatg 


tttatctgag 


ctcttatcta 


tgaagacatc 


ttcccagagt 


gtccccagag 


1080 




acatgcaagt 


catgggtcac 


acctgacaaa 


tggaaggagt 


tcctctaaca 


tttgcaaaat 


1140 




ggaaatgtaa 


taataatgaa 


tgtcatgcac 


cgctgcagcc 


agcagttgta 


aaattgttag 


1200 


35 


taaacatagg 


tataattacc 


agatagttat 


attaaattta 


tatcttatat 


ataataatat 


1260 




gtgatgatta 


atacaatatt 


aattataata 


aaggtcacat 


aaactttata 


aattcataac 


1320 




tggtagctat 


aacttgagct 


tattcaggat 


ggtttcttta 


aaaccataaa 


ctgtacaaat 


1380 


40 


gaaatttttc 


aatatttgtt 


tctta 








1405 



45 Claims 

1. A compound of the formula (I), or pharmaceutical^ acceptable salts or solvates thereof, 



50 
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R3 



R2 




(i) 



wherein the substitutents and indices have the following meanings: 

R 19 R 2 , R 3 and R 4 , is H, halogen, hydroxy, protected hydroxy, cyano, nitro, C 1 to C 6 alkyl, C 1 to C 6 substituted 
alkyl, C 1 to C 7 alkoxy, C 1 to C 7 substituted alkoxy, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, C 1 to C 7 acyloxy, 
carboxy, protected carboxy, carboxymethyl, protected carboxymethyl, hydroxymethyl, protected hydroxyme- 
thyl, amino, protected amino, (monosubstituted)amino, protected (monosubstituted)amino, (disubstituted)ami- 
no, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to C 6 alkyl)carbox- 
amide, N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, N-(phenylsulfo- 
nyl)amino or substituted or unsubstituted phenyl, 

R 5 is H, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, aryl, 
substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl 

R 6 is H, a secondary or a tertiary amine, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to 
C 8 substituted cycloalkyl, aryl, substituted aryl, C1 to C6 alkylphenyl or Gy to C 12 substituted phenylalkyl 

A compound according to claim 1 , or pharmaceutical^ acceptable salts or solvates thereof, of formula (II) 



wherein the substitutents and indices have the following meanings: 

R 7 is C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, aryl, sub- 
stituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl 

A compound according to claim 1 , or pharmaceutical^ acceptable salts or solvates thereof, of formula (III) 



R1 o 




(ii) 
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Wherein the substituents and indices have the following meanings: 

R 1s R 2 , R 3 , R 4 , is H, halogen, hydroxy, protected hydroxy, cyano, nitro, C 1 to C 6 alkyl, C 1 to C 6 substituted 
alkyl, C 1 to C 7 alkoxy, C 1 to C 7 substituted alkoxy, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, C 1 to C 7 acyloxy, 
carboxy, protected carboxy, carboxymethyl, protected carboxymethyl, hydroxymethyl, protected hydroxyme- 
thyl, amino, protected amino, (monosubstituted)amino, protected (monosubstituted)amino, (disubstituted)ami- 
no, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 to C 6 alkyl)carbox- 
amide, N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, N-(phenylsulfo- 
nyl)amino or phenyl, 

R 5 in formula (III) is H, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted 
cycloalkyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl, 

R 8 in formula (III) is H, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted 
cycloalkyl, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 
substituted phenylalkyl, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 
to C 6 alkyl)carboxamide, N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, 
N- (phenylsulfonyl)amino or phenyl, 

R 9 in formula (III) is H, C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted 
cycloalkyl, C 1 to C 7 acyl, C 1 to C 7 substituted acyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 
substituted phenylalkyl, carboxamide, protected carboxamide, N-(C 1 to C 6 alkyl)carboxamide, protected N-(C 1 
to C 6 alkyl)carboxamide, N, N-di(C 1 to C 6 alkyl)carboxamide, trifluoromethyl, N-((C 1 to C 6 alkyl)sulfonyl)amino, 
N- (phenylsulfonyl)amino or phenyl, 

R 8 and R 9 in formula (III) are taken together with the N in formula (III) to form a ring of the following types: 

C 3 to C 8 mono, di- or tri- heteroatom-substituted cycloalkyl, preferably piperidino, morpholino, thiomor- 
pholino, piperazino, or C 1 to C 6 alkyl-, C 1 to C 6 substituted alkyl subtituted- or benzocondensated deriv- 
atives thereof, 

C 4 to C 10 mono, di- or tri- heteroatom-substituted bi- or tricycloalkyl derivatives thereof or C 1 to C 6 alkyl-, 
C 1 to C 6 substituted alkyl subtituted- or benzocondensated derivatives thereof, 

C 3 to C 8 mono, di- or tri- heteroatom-substituted heteroaryl, preferably pyrrolo, pyrazolo, oxazolo, thiazolo, 
pyridino, pyridazino, pyrimidino, pyrazino, triazino, indolo, quinolino, quinazolino, or C 1 to C 6 alkyl-, C 1 to 
C 6 substituted alkyl subtituted- or benzocondensated derivatives thereof 

A compound according to claim 3 wherein 

R 1 and R 3 are H, 

R 2 and R 4 are H, or iodo, bromo, chloro orfluoro or hydroxy methyl, trifluoromethyl, methoxy, amino, alkylated 
amino, or nitro 

R 5 is H, C 1 to C 8 alkyl, preferrably isopropyl, vinyl, 2-isobutyl, or cyclohexyl, C 1 to C 8 substituted alkyl, C 3 to 
C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 1 2 substituted 
phenylalkyl, 

R 8 and R 9 in formula (III) are taken together with the N in formula (III) to form a ring of the following types: 
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C 3 to C 8 mono, di- or tri- heteroatom-substituted cycloalkyl, preferably piperidino, morpholino, thiomor- 
pholino, piperazino, or C 1 to C 6 alkyl-C^-Ce substituted or benzocondensated derivatives thereof, more 
preferrably a 4-aryl- or 4-heteroaryl- or 4-alkyl-substituted piperazino or a 4-alkyl-substituted piperidino 

C 4 to C 10 mono, di- or tri- heteroatom-substituted bi- or tricycloalkyl or C 1 to C 6 alkyl-, C^Cg substituted 
alkyl substituted- or benzocondensated derivatives thereof 

C 3 to C 8 mono, di- or tri- heteroatom-substituted heteroaryl, preferably pyrrolo, pyrazolo, oxazolo, thiazolo, 
pyridino, pyridazino, pyrimidino, pyrazino, triazino, indolo, quinolino, quinazolino or C 1 to C 6 alkyl- C 1 to 
C 6 substituted alkyl substituted- or benzocondensated derivatives thereof 

A compound according to any of claims 1 to 3 wherein 

R 1 and R 3 are H, 

R 2 and R 4 are H, or iodo, bromo, chloro or fluoro or hydroxy methyl, trifluoromethyl, methoxy, amino, alkylated 
amino, or nitro 

R 5 is H, C 1 to C 8 alkyl, preferrably isopropyl, vinyl, 2-isobutyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, 
C 3 to C 8 substituted cycloalkyl, aryl, substituted aryl, C1 to C6 alkylphenyl or C 7 to C 12 substituted phenylalkyl, 
R 8 is a C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, C 1 to C 6 
alkylphenyl or C 7 to C 12 substituted phenylalkyl, preferrably benzyl or (furan-2-yl)-methyl or (thiophen-2-yl)- 
methyl 

and R 9 in formula is C 1 to C 8 alkyl, C 1 to C 8 substituted alkyl, C 3 to C 8 cycloalkyl, C 3 to C 8 substituted cycloalkyl, 
C 1 to C 6 alkylphenyl or C 7 to C 12 substituted phenylalkyl, preferably (ethyl)-propion-l -yl 

A compound according to any of claims 1 to 5 being 




(IV) 



A compound according to any of claims 1 to 5 being 




O CH 3 



(V) 



A compound according to any of claims 1 to 5 being 
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12. A compound according to any of claims 1 and 5 being 




13. A compound according to any of claims 1 and 5 being 




14. A compound according to any of claims 1 and 5 being 
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15. A compound according to any of claims 1 and 5 being 




(XIII) 



16. A compound according to any of claims 1 to 15 wherein said compound is capable of binding the NR1 H3 receptor 
protein encoded by a nucleic acid comprising SEQ I D NO:2 or a portion thereof or a mammalian homologue thereof. 

17. A compound according to any of claims 1 to 15 wherein said compound is capable of binding the NR1 H2 receptor 
protein encoded by a nucleic acid comprising SEQ I D NO:1 or a portion thereof or a mammalian homologue thereof. 

18. A compound according to any of claims 1 to 15 for use as a medicament. 

19. A method for prevention or treatment of a NR1H3 and/or NR1H2 receptor protein mediated disease or NR1H3 
and/or NR1 H2 receptor protein homologue mediated disease or condition in a mammal comprising administration 
of a therapeutically effective amount of a compound according to claims 1 to 15 wherein the prevention or treatment 
is directly or indirectly accomplished through the binding of the compound according claims 1 to 15 to the NR1 H3 
and/or NR1 H2 receptor proteins or to the NR1 H3 and/or NR1 H2 receptor protein homologues. 

20. A method for regulating the cholesterol synthesis or transport in a mammal which comprises activating the NR1 H3 
and/or NR1 H2 receptors with a therapeutically effective amount of a compound according to claims 1 to 15. 
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21. A method of treating in mammal a disease which is affected by cholesterol, triglyceride, bile acid, glucose or 
glucocorticoid levels comprising administering to a mammal in need of such treatment a therapeutically effective 
amount of a compound according to claims 1 to 15. 

22. A method of treating in a mammal Atherosclerosis, Alzheimers disease, Type II diabetes, lipid disorders, obesity, 
an inflammatory or a cardiovascular disorder comprising administering to a mammal in need of such treatment a 
therapeutically effective amount of a compound according to claims 1 to 15. 

23. A method according to any of claims 1 9 to 22, wherein the expression of one or more of the genes out of the group 
comprising ABCA1, ABCG1, ABCG5, ABCG8, apolipoprotein-CI, -CM, -CIV, -E, LPL (lipoprotein lipase), CETP 
(cholesteryl ester transfer protein) or other genes that positively regulate cholesterol homeostasis is increased 
upon compound administration. 

24. A method according to any of claims 1 9 to 22, wherein the expression of one or more of the genes out of the group 
comprising 11-p-HSD (11-3 hydroxysteroid dehydrogenase), PEPCK (phosphoenolpyruvatcarboxykinase), G-6-P 
(glucose-6-phosphatase) is reduced upon compound administration. 

25. A method according to any of claims 1 9 to 22, wherein the expression of one or more of the genes out of the group 
comprising FAS (Fatty Acid Synthase), SREBP-1c (Sterol-response element binding protein), SCD-1 
(Stearoyl-CoA Desaturase), Angiopoietin like protein 3 (Angptl3) or other genes which are relevant for controlling 
serum triglyceride or glucose levels are not or more weakly increased in liver and or other organs compared to 
administration of a full agonist like T0901317. 

26. A method of blocking in a mammal the cholesterol or fatty acid absorption in the intestine of a mammal in need of 
such blocking comprising administering to a mammal in need of such treatment a therapeutically effective amount 
of a compound as defined in any of claims 1 to 15. 

27. A method for treating obesity in a mammal comprising administring a therapeutically effective amount of a com- 
pound as defined in any of claims 1 to 15. 

28. A method of modulating a gene whose expression is regulated by the NR1 H3 and/or NR1 H2 receptor in a mammal 
comprising administering a therapeutically effective amount of a compound as defined in any claims 1 to 15. 

29. A method according to any of claims 19 to 28 wherein the expression of ABCA1 and/or ABCG1 and/or ABCG5 
and/or ABCG8 is increased. 

30. Use of a compound as defined in any of claims 1 to 15 in a method according to claims 19 to 29 wherein the 
mammal is a human. 

31 . Use of a compound as defined in any of claims 1 to 1 5 for the manufacture of a medicament for the prevention or 
treatment of a NR1 H3 and/or NR1 H2 receptor protein or NR1 H3 and/or NR1 H2 receptor protein homologue me- 
diated disease or condition in a mammal wherein the prevention or treatment is directly or indirectly accomplished 
through the binding of the compound to the NR1H3 and/or NR1H2 receptor protein or NR1H3 and/or NR1H2 
receptor protein homologue. 

32. Use of a compound as defined in any of claims 1 to 15 for the manufacture of a medicament for prevention or 
treatment of a NR1H3 and/or NR1H2 receptor protein mediated disease or condition wherein the mammal is a 
human. 

33. Use of a compound as defined in any of claims 1 to 15 for the manufacture of a medicament for regulating the 
cholesterol transport system in a mammal by activating the NR1H3 and/or NR1 H2 receptor. 

34. Use of a compound as defined in any of claims 1 to 15 for the manufacture of a medicament for regulating levels 
of cholesterol, triglyceride, and/or bile acid. 

35. Use of a compound as defined in any of claims 1 to 15 for the manufacture of a medicament for treating in a 
mammal atherosclerosis, Alzheimer disease, gallstone disease, lipid disorders, inflammatory disorder, type II di- 
abetis, obesity or a cardiovascular disorder. 
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36. Use of a compound as defined in any of claims 1 to 15 for the manufacture of a medicament capable for blocking 
in a mammal the cholesterol and/or fatty acid absorption in the intestine. 

37. Use of a compound as defined in any of claims 1 to 15 for the manufacture of a medicament for treating obesity 
in a mammal. 

38. Use of a compound as defined in any of claims 1 to 1 5 for the manufacture of a medicament for modulating a gene 
whose expression is regulated by the NR1 H3 and/or NR1 H2 receptor. 

39. Use of a compound as defined in any of claims 1 to 1 5 in a mammal for the selective up-regulation of one or more 
genes selected from the group comprising ABCA1, ABCG1, ABCG5, ABCG8, apolipoprotein-CI, -CM, -CIV, -E, 
LPL (lipoprotein lipase), CETP (cholesteryl ester transfer protein) or other genes that positively regulate cholesterol 
homeostasis are increased and a weaker regulation of one or more of the genes selected from the group comprising 
FAS and SREBP-1 c or other genes that positively regulate lipogenesis, said compound showing a larger difference 
in regulation of the two groups of genes when compared with the regulatory behaviour of a full agonist like T0901 31 7 
on both groups of genes. 

40. The use according to any of claims 31 to 39, wherein the mammal is human. 
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Fig. 2 
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Fig. 3 
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Fig. 4 
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Fig. 5 
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Fig. 6 
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